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Preface 


This is an introduction to ordinary differential equations. We describe the main ideas to 
solve certain differential equations, such us first order scalar equations, second order linear 
equations, and systems of linear equations. We use power series methods to solve variable 
coefficients second order linear equations. We introduce Laplace transform methods to find 
solutions to constant coefficients equations with generalized source functions. We provide 
a brief introduction to boundary value problems, eigenvalue-eigenfunction problems, and 
Fourier series expansions. We end these notes solving our first partial differential equation, 
the heat equation. We use the method of separation of variables, where solutions to the 
partial differential equation are obtained by solving infinitely many ordinary differential 
equations. 


CHAPTER 1 


First Order Equations 


We start our study of differential equations in the same way the pioneers in this field did. 
We show particular techniques to solve particular types of first order differential equations. 
The techniques were developed in the eighteenth and nineteenth centuries and the equations 
include linear equations, separable equations, Euler homogeneous equations, and exact equa- 
tions. This way of studying differential equations reached a dead end pretty soon. Most of 
the differential equations cannot be solved by any of the techniques presented in the first 
sections of this chapter. People then tried something different. Instead of solving the equa- 
tions they tried to show whether an equation has solutions or not, and what properties such 
solution may have. This is less information than obtaining the solution, but it is still valu- 
able information. The results of these efforts are shown in the last sections of this chapter. 
We present theorems describing the existence and uniqueness of solutions to a wide class of 
first order differential equations. 


4 1. FIRST ORDER EQUATIONS 


1.1. Linear Constant Coefficient Equations 


1.1.1. Overview of Differential Equations. A differential equation is an equation, 
where the unknown is a function and both the function and its derivatives may appear in 
the equation. Differential equations are essential for a mathematical description of nature— 
they lie at the core of many physical theories. For example, let us just mention Newton’s 
and Lagrange’s equations for classical mechanics, Maxwell’s equations for classical electro- 
magnetism, Schrédinger’s equation for quantum mechanics, and Einstein’s equation for the 
general theory of gravitation. We now show what differential equations look like. 


Example 1.1.1. 

(a) Newton’s law: Mass times acceleration equals force, ma = f, where m is the particle 
mass, a = d?x/dt? is the particle acceleration, and f is the force acting on the particle. 
Hence Newton’s law is the differential equation 

d?x dx 
mo (t) = (t, a(¢ — t)), 
=F (tal), FW) 
where the unknown is a(t)—the position of the particle in space at the time t. As we 
see above, the force may depend on time, on the particle position in space, and on the 
particle velocity. 


Remark: This is a second order Ordinary Differential Equation (ODE). 


(b) Radioactive Decay: The amount u of a radioactive material changes in time as follows, 


du 
—(t) = —ku(t k 
c(t) =—kult), b>, 


where & is a positive constant representing radioactive properties of the material. 
Remark: This is a first order ODE. 


(c) The Heat Equation: The temperature T in a solid material changes in time and in 
three space dimensions—labeled by x = (x, y, z)—according to the equation 
oT all or oT ) 


ght) = Galha) + Zalhe) + alba) k >0, 


where & is a positive constant representing thermal properties of the material. 
Remark: This is a first order in time and second order in space PDE. 


(d) The Wave Equation: A wave perturbation u propagating in time ¢ and in three space 
dimensions—labeled by x = (x, y, z)—through the media with wave speed v > 0 is 


Pu oO? O7u Ou 


Remark: This is a second order in time and space Partial Differential Equation (PDE). 


The equations in examples (a) and (b) are called ordinary differential equations (ODE)— the 
unknown function depends on a single independent variable, t. The equations in examples 
(d) and (c) are called partial differential equations (PDE)—the unknown function depends 
on two or more independent variables, t, x, y, and z, and their partial derivatives appear in 
the equations. 

The order of a differential equation is the highest derivative order that appears in the 
equation. Newton’s equation in example (a) is second order, the time decay equation in 
example (b) is first order, the wave equation in example (d) is second order is time and 
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space variables, and the heat equation in example (c) is first order in time and second order 
in space variables. 


1.1.2. Linear Differential Equations. We start with a precise definition of a first 
order ordinary differential equation. Then we introduce a particular type of first order 
equations—linear equations. 


Definition 1.1.1. A first order ODE on the unknown y is 
y(t) = f(t,y@), (1.1.1) 


d : . ee 
where f is given andy! = The equation is linear iff the source function f is linear on 


its second argument, 
y =a(t)y+t W(t). (14,2) 


The linear equation has constant coefficients iff both a and b above are constants. Oth- 
erwise the equation has variable coefficients. 


There are different sign conventions for Eq. (1.1.2) in the literature. For example, Boyce- 
DiPrima [3] writes it as y/ = —ay+b. The sign choice in front of function a is matter of 
taste. Some people like the negative sign, because later on, when they write the equation 
as y’ +ay = b, they get a plus sign on the left-hand side. In any case, we stick here to the 
convention y’ = ay + b. 


Example 1.1.2. 


(a) An example of a first order linear ODE is the equation 
y =2y+3. 


On the right-hand side we have the function f(t, y) = 2y + 3, where we can see that 
a(t) = 2 and b(t) = 3. Since these coefficients do not depend on ¢, this is a constant 
coefficient equation. 

(b) Another example of a first order linear ODE is the equation 


f 


2 
y = 


In this case, the right-hand side is given by the function f(t, y) = —2y/t + 4t, where 
a(t) = —2/t and b(t) = 4t. Since the coefficients are nonconstant functions of t, this is 
a variable coefficients equation. 


2 
(c) The equation y/ = es 4t is nonlinear. 
y 
dq 
We denote by y: D C R > R a real-valued function y defined on a domain D. Such 
a function is solution of the differential equation (1.1.1) iff the equation is satisfied for all 
values of the independent variable t on the domain D. 


3 
Example 1.1.3. Show that y(t) =e” — 5 is solution of the equation 


y =2y+3. 
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Solution: We need to compute the left and right-hand sides of the equation and verify they 
agree. On the one hand we compute y’(t) = 2e7¢. On the other hand we compute 


3 
2y(t)+3= 2(e es =) 43 = 2%, 
We conclude that y’(t) = 2y(t) + 3 for allt ER. <J 


Example 1.1.4. Find the differential equation y’ = f(y) satisfied by y(t) = 4e” + 3. 
Solution: (Solution Video) We compute the derivative of y, 
y’ —8 e2t 
We now write the right-hand side above, in terms of the original function y, that is, 
yo4e*+3 => y-3=4e% > 2%Wy—3)=Be™. 
So we got a differential equation satisfied by y, namely 
y = 2y—-6. 
dq 
1.1.3. Solving Linear Differential Equations. Linear equations with constant co- 


efficient are simpler to solve than variable coefficient ones. But integrating each side of the 
equation does not work. For example, take the equation 


y = 2y +3, 


and integrate with respect to ¢ on both sides, 
[u@a=2fyasstre ceER. 


The Fundamental Theorem of Calculus implies y(t) = f y’(t) dt, so we get 


y(t) = 2 f ylt)dt + 3t+e. 


Integrating both sides of the differential equation is not enough to find a solution y. We 
still need to find a primitive of y. We have only rewritten the original differential equation 
as an integral equation. Simply integrating both sides of a linear equation does not solve 
the equation. 

We now state a precise formula for the solutions of constant coefficient linear equations. 
The proof relies on a new idea—a clever use of the chain rule for derivatives. 


Theorem 1.1.2 (Constant Coefficients). The linear differential equation 


y =ayt+b (1.1.3) 
with a £0, 6 constants, has infinitely many solutions, 
b 
y(t) =ce™ — -, ceER. (1.1.4) 
a 


Remarks: 


(a) Equation (1.1.4) is called the general solution of the differential equation in (1.1.3). 


(b) Theorem 1.1.2 says that Eq. (1.1.3) has infinitely many solutions, one solution for each 
value of the constant c, which is not determined by the equation. 
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(c) It makes sense that we have a free constant c in the solution of the differential equa- 
tion. The differential equation contains a first derivative of the unknown function y, 
so finding a solution of the differential equation requires one integration. Every indefi- 
nite integration introduces an integration constant. This is the origin of the constant c 
above. 


Proof of Theorem 1.1.2: First consider the case b = 0, so y’! = ay, with a € R. Then, 


ysay + T=a = In(lyl/=a + Illy) =attea, 
where Cp € R is an arbitrary integration constant, and we used the Fundamental Theorem 


of Calculus on the last step, { In(|y|)’ dt = In(|y|). Compute the exponential on both sides, 


y(t) = tet —~+e%e%, denote c=te® => y(t)=ce™%, cER. 


This is the solution of the differential equation in the case that b = 0. The case b 4 0 can 
be converted into the case above. Indeed, 


j ; b bs! b 
y=aytb = y=a(y+-) = (y+-) =a(y+-), 
a a a 
since (b/a)’ = 0. Denoting y = y + (b/a), the equation above is 7’ = ay. We know all the 
solutions to that equation, 


t 


b b 
g(t) =ce", cEeR = y(t) + — = cet => y(t)=ce”—-. 


This establishes the Theorem. 


Remark: We solved the differential equation above, y’ = ay, by transforming it into a 
total derivative. Let us highlight this fact in the calculation we did, 


In(lyl)' =a + (In(lyl) at)’ =0 & vty) =0, with Y=In(\y(Z)|) at. 
The function w is called a potential function. This is how the original differential equation 
gets transformed into a total derivative, 


y=ay > wW=0. 
Total derivatives are simple to integrate, 
wy’ =0 > wh = Co, GER. 


So the solution is 


In(\y|)-—at=c => In(ly)=atat = y(t) = te = +e e™, 


and denoting c = +e we reobtain the formula 


y(t) =ce™. 


b 
In the case b £ 0 a potential function is w(t, y(t)) = In(|y(t) + -|) — at. 


Example 1.1.5. Find all solutions to the constant coefficient equation y! = 2y + 3. 
Solution: (Solution Video) Let’s pull a common factor 2 on the right-hand side of the 


equation, 
=2(y45) = (v+5) =2(v+5) 
yY=4\Y 5) y a) y a)" 
Denoting 7 = y + (3/2) we get 
~/ 


i =29 a 2 In(|g|)’ =2 => In(|gl) =2t+0. 
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We now compute exponentials on both sides, to get 


@(t) = ze?*+0 = te% ce, denote c=+e™, then g(t)=ce™*, ceER. 
- 3 3 
Since y= y+ 5? we get y(t) = ce’! — 5, where c € R. <J 


Remark: We converted the original differential equation y’ = 2y+3 into a total derivative 
of a potential function 7)’ = 0. The potential function can be computed from the step 


In({g))’ =2 => — (In({g|) — 2)’ = 0, 


3 
then a potential function is w(t, y(t)) = In([y(t) + 1) — 2t. Since the equation is now 
w’ = 0, all solutions are = c, with c. € R. That is 


In(|y(t) + 


bereteo | 


No] 


3 3 
S|) —2t= 6 In([y(t)+5]) =2t+e => y(t) + 


If we denote c = te, then we get the solution we found above, y(t) = ce” — 

1.1.4. The Integrating Factor Method. The argument we used to prove Theo- 
rem 1.1.2 cannot be generalized in a simple way to all linear equations with variable coef- 
ficients. However, there is a way to solve linear equations with both constant and variable 
coefficients—the integrating factor method. Now we give a second proof of Theorem 1.1.2 
using this method. 


Second Proof of Theorem 1.1.2: Write the equation with y on one side only, 


y—ay=b, 
and then multiply the differential equation by a function p, called an integrating factor, 
py’ —apy = pb. (1.1.5) 
Now comes the critical step. We choose a positive function yz such that 
—ap=yp. (1.1.6) 


For any function yz solution of Eq. (1.1.6), the differential equation in (1.1.5) has the form 
py +py =p. 
But the left-hand side is a total derivative of a product of two functions, 
/ 
(uy) =p. (1.1.7) 


This is the property we want in an integrating factor, w. We want to find a function ys such 
that the left-hand side of the differential equation for y can be written as a total derivative, 
just as in Eq. (1.1.7). We only need to find one of such functions . So we go back to 
Eq. (1.1.6), the differential equation for 41, which is simple to solve, 


we=map > Ea -a > (In(lul))’=-a + In(lpl) =-at +6 


Computing the exponential of both sides in the equation above we get 


co—at co ,—at at 


+e“ e => p=ae™, c, = te. 


jo= xe 


Since c, is a constant which will cancel out from Eq. (1.1.5) anyway, we choose the integration 
constant Co = 0, hence c, = 1. The integrating function is then 


noe, 
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This function is an integrating factor, because if we start again at Eq. (1.1.5), we get 
—at if 


ey —-ae “y=be* => |e My + (e~")'y =be™, 


where we used the main property of the integrating factor, -ae~“ = (ey. Now the 
product rule for derivatives implies that the left-hand side above is a total derivative, 


(ene y). =be, 


b / 
The right-hand side above can be rewritten as a derivative, be~*! = (-- ee) , hence 
a 


/ 


(* y+ oat) =0 [(y + *) ev] =0. 


We have succeeded in writing the whole differential equation as a total derivative. The 
differential equation is the total derivative of a potential function, which in this case is 


v(t,y) = (y + 3 aa 


Notice that this potential function is the exponential of the potential function found in the 
first proof of this Theorem. The differential equation for y is a total derivative, 


F(t ult)) =0, 


so it is simple to integrate, 
b b 
vey))=c > (y+ 2)eM=c > y(t)=ce™——. 


This establishes the Theorem. 
We solve the example below following the second proof of Theorem 1.1.2. 


Example 1.1.6. Find all solutions to the constant coefficient equation 


y! = 24 +3 (1.1.8) 


Solution: (Solution Video) Write the equation in (1.1.8) as follows, 
y! — dy = 3. 


—2t 
’ 


Multiply this equation by the integrating factor y(t) = e 
ety! —2e7%t y= 3602 es ety! a (e-*)'y =3e-%, 
We now solve the same problem above, but now using the formulas in Theorem 1.1.2. 
Example 1.1.7. Find all solutions to the constant coefficient equation 


y! =24y +3 (1.1.9) 


Solution: The equation above is the case of a = 2 and b = 3 in Eq. (1.1.3). Therefore, 
using these values in the expression for the solution given in Eq. (1.1.4) we obtain 
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The equation on the far right above is 
eo y)! = 3e77t. 
Rewrite the right-hand side above, 
3 / 
leo y)! = (-5 e*) : 


Moving terms and reordering factors we get 


[oer] 0 


The equation is a total derivative, ~’ = 0, of 
the potential function 


v(t,y) = (y + >) a 


Now the equation is easy to integrate, 


3 
(y + =) eae 


FiGuRE 1. A few solutions 
So we get the solutions to Eq. (1.1.8) for different c. 
(Interactive Graph) 


y(t) = ce — —, ceER. J 


1.1.5. The Initial Value Problem. Sometimes in physics one is not interested in all 
solutions to a differential equation, but only in those solutions satisfying extra conditions. 
For example, in the case of Newton’s second law of motion for a point particle, one could 
be interested only in solutions such that the particle is at a specific position at the initial 
time. Such condition is called an initial condition, and it selects a subset of solutions of the 
differential equation. An initial value problem means to find a solution to both a differential 
equation and an initial condition. 


Definition 1.1.3. The initial value problem (IVP) is to find all solutions y to 
y’ =ayt+d, (1.1.10) 
that satisfy the initial condition 


y(to) = Yor 4550) 


where a, b, to, and Yo are given constants. 


Remark: The equation (1.1.11) is called the initial condition of the problem. 


Although the differential equation in (1.1.10) has infinitely many solutions, the associ- 
ated initial value problem has a unique solution. 


Theorem 1.1.4 (Constant Coefficients IVP). Given the constants a,b, to, yo € R, with 
a #0, the initial value problem 


yi =aytb, — y(to) = Yo, 
has the unique solution 


b b 
u(t) = (vo + a ealtto) 7 (1.1.12) 
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Remark: In case t, = 0 the initial condition is y(0) = yo and the solution is 


b b 

u(t) = (yo +) et =. 

a a 
The proof of Theorem 1.1.4 is just to write the general solution of the differential 
equation given in Theorem 1.1.2, and fix the integration constant c with the initial condition. 


Proof of Theorem 1.1.4: The general solution of the differential equation in (1.1.10) is 


given in Eq. (1.1.4) for any choice of the integration constant c, 
b 

th= ae 
y(t) = ces — — 


The initial condition determines the value of the constant c, as follows 


b b 
Yo = y(to) —cet _ = “4 c= (u 4 = e- ato, 
o a 
Introduce this expression for the constant c into the differential equation in Eq. (1.1.10), 
b b 


u(t) = (yo + =)eatete) — =. 
a a 


This establishes the Theorem. 


Example 1.1.8. Find the unique solution of the initial value problem 
y =2y+3, y(0) = 1. (1.1.13) 


Solution: (Solution Video) All solutions of the differential equation are given by 


3 
et 2 
y(t) = ce" — 5, 
where c is an arbitrary constant. The initial condition in Eq. (1.1.13) determines c, 
3 5 
1 — 0 — —— => =, 
yO) =e, ak 


No] we 
A 


5) 
Then, the unique solution to the initial value problem above is y(t) = 5 ge a 


Example 1.1.9. Find the solution y to the initial value problem 
y=-3y+1, = y(0)=1. 
Solution: (Solution Video) Write the differential equation as y’ + 3y = 1, and multiply 


the equation by the integrating factor 4 = e®', which will convert the left-hand side above 
into a total derivative, 


erty! + 363 y = Ss tty! + (*)'y =e, 


This is the key idea, because the derivative of a product implies 


The exponential e*’ is an integrating factor. Integrate on both sides of the equation, 


1 
3t 3t 
e = =e °C. 
a3 
So every solution of the differential equation above is given by 


1 
y(t) =ce™ +3, ceER. 
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The initial condition y(0) = 2 selects only one solution, 


1 2 
1=y(0) = =~ => =n. 
y(0) cts c=. 


2 il 
We get the solution y(t) = 3 ey 3° <1 


Notes. This section corresponds to Boyce-DiPrima [3] Section 2.1, where both constant 
and variable coefficient equations are studied. Zill and Wright give a more concise exposition 
in [17] Section 2.3, and a one page description is given by Simmons in [10] in Section 2.10. 
The integrating factor method is shown in most of these books, but unlike them, here we 
emphasize that the integrating factor changes the linear differential equation into a total 
derivative, which is trivial to integrate. We also show here how to compute the potential 
functions for the linear differential equations. In § 1.4 we solve (nonlinear) exact equations 
and nonexact equations with integrating factors. We solve these equations by transforming 
them into a total derivative, just as we did in this section with the linear equations. 


1.1. LINEAR CONSTANT COEFFICIENT EQUATIONS 13 


1.1.6. Exercises. 


1.1.1.- Find the differential equation of the 
form y’ = f(y) satisfied by the function 
2 


t) = 8e — =. 
y(t) = Be" — = 


1.1.2.- Find constants a, 6, so that 
y(t) = (t+3) 6" 
is solution of the IVP 
yi =ay+e™%, (0) =b. 
1.1.3.- Find all solutions y of 
y = 3y. 


1.1.4.- Follow the steps below to find all so- 
lutions of 
y =—4y +2 
(a) Find the integrating factor p. 
(b) Write the equations as a total de- 
rivative of a function ~w, that is 


wy’ =0. 


(c) Integrate the equation for w. 
(d) Compute y using part (c). 


y =—4y+2 ¢ 


1.1.5.- Find all solutions of 
y =2y+5 


1.1.6.- Find the solution of the IVP 
y = —4y +2, y(0) = 5. 


1.1.7.- Find the solution of the IVP 


Y(t) =3y(t)-2, yl) =1. 


1.1.8.- Express the differential equation 
y =6y+1 (1.1.14) 
as a total derivative of a potential func- 
tion 2(t, y), that is, find ~ satisfying 
y =6y4+1 6 yw’ =0. 
Integrate the equation for the poten- 


tial function ~ to find all solutions y of 
Eq. (1.1.14). 


1.1.9.- Find the solution of the IVP 
y =6y+1, y(0)=1. 


1.1.10.- * Follow the steps below to solve 
y =—3y+5, y(0)=1. 

(a) Find any integrating factor pu for 
the differential equation. 

(b) Write the differential equation as a 
total derivative of a potential func- 
tion wp. 

(c) Use the potential function to find 
the general solution of the differen- 
tial equation. 

(d) Find the solution of the initial value 
problem above. 
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1.2. Linear Variable Coefficient Equations 


In this section we obtain a formula for the solutions of variable coefficient linear equations, 
which generalizes Equation (1.1.4) in Theorem 1.1.2. To get this formula we use the integrat- 
ing factor method—already used for constant coefficient equations in § 1.1. We also show 
that the initial value problem for variable coefficient equations has a unique solution—just 
as happens for constant coefficient equations. 

In the last part of this section we turn our attention to a particular nonlinear differential 
equation—the Bernoulli equation. This nonlinear equation has a particular property: it can 
be transformed into a linear equation by an appropriate change in the unknown function. 
Then, one solves the linear equation for the changed function using the integrating factor 
method. The last step is to transform the changed function back into the original function. 


1.2.1. Review: Constant Coefficient Equations. Let us recall how we solved the 
constant coefficient case. We wrote the equation y’ = ay + b as follows 


faa(v+2), 


The critical step was the following: since b/a is constant, then (b/a)’ = 0, hence 


by! b 
(r+ 8) =a(v+8) 
a a 
At this point the equation was simple to solve, 
+ 4) b 
UT ad - In(|y rn 
(ga) a 


We now computed the exponential on both sides, to get 


! b 
) =a => In(|y + =|) =) +at. 
a 


ly + °| = erat — ete Se yt 2 = Gee", 
a a 


and calling c = te® we got the formula 


This idea can be generalized to variable coefficient equations, but only in the case where 
b/a is constant. For example, consider the case b = 0 and a depending on t. The equation 
is y’ = a(t) y, and we can solve it as follows, 


5 =a(t) => In(lyl)’=at) = In(ly(t)|) = AG) +o, 
where A = [ adt, is a primitive or antiderivative of a. Therefore, 
y(t) = teAM +00 = $e4M 0, 
so we get the solution y(t) = ce4, where c = +e. 


Example 1.2.1. The solutions of y’ = 2ty are y(t) = cet, where cE R. 


However, the case where b/a is not constant is not so simple to solve—we cannot add 
zero to the equation in the form of 0 = (b/a)’. We need a new idea. We now show an idea 
that works with all first order linear equations with variable coefficients—the integrating 
factor method. 
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1.2.2. Solving Variable Coefficient Equations. We now state our main result—the 
formula for the solutions of linear differential equations with variable coefficiets. 


Theorem 1.2.1 (Variable Coefficients). If the functions a, b are continuous, then 
y =a(t)y+d(0), (1.2.1) 


has infinitely many solutions given by 
y(t) = ce4® + eA® / eA) b(t) de, (1.2.2) 
where A(t) = f a(t) dt andceER. 


Remarks: 


(a) The expression in Eq. (1.2.2) is called the general solution of the differential equation. 
(b) The function p(t) = e~4™ is called the integrating factor of the equation. 


Example 1.2.2. Show that for constant coefficient equations the solution formula given in 
Eq. (1.2.2) reduces to Eq. (1.1.4). 


Solution: In the particular case of constant coefficient equations, a primitive, or antideriv- 
ative, for the constant function a is A(t) = at, so 


y(t) =ce™ + ‘| e bdt. 


Since b is constant, the integral in the second term above can be computed explicitly, 
a i be di = ewt(-2 a = ae 
a a 


b 
Therefore, in the case of a, b constants we obtain y(t) = ce“ — — given in Eq. (1.1.4). < 
a 


Proof of Theorem 1.2.1: Write the differential equation with y on one side only, 


y’—ay=b, 
and then multiply the differential equation by a function p, called an integrating factor, 
py’ —apy=pb. (1.2.3) 
The critical step is to choose a function y such that 
—ap=_p. (1.2.4) 


For any function yu solution of Eq. (1.2.4), the differential equation in (1.2.3) has the form 
py +p y= pb. 
But the left-hand side is a total derivative of a product of two functions, 
/ 
(uy) = pd. (1.2.5) 
This is the property we want in an integrating factor, w. We want to find a function yz such 
that the left-hand side of the differential equation for y can be written as a total derivative, 


just as in Eq. (1.2.5). We need to find just one of such functions p. So we go back to 
Eq. (1.2.4), the differential equation for 41, which is simple to solve, 


pl =-ap E-_a In(\ul)’ =-a => In(yl) =-A+eo, 
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where A = / adt, a primitive or antiderivative of a, and c is an arbitrary constant. Com- 
puting the exponential of both sides we get 


“As b= ae. Cc, = te. 


b= te e 


Since c, is a constant which will cancel out from Eq. (1.2.3) anyway, we choose the integration 
constant co = 0, hence c, = 1. The integrating factor is then 


p(t) =e 4), 
This function is an integrating factor, because if we start again at Eq. (1.2.3), we get 
e4y -ae4y=e4b 3S ee Ay + (e~4)'y =e “4p, 


where we used the main property of the integrating factor, -ae~4 = (ea). Now the 
product rule for derivatives implies that the left-hand side above is a total derivative, 


(e-4 y). =e“. 


Integrating on both sides we get 
(e4 y) = [eAvate => (e4 y) ~ [evar =C¢. 


The function W(t,y) = (e-4y) — fe~“bdt is called a potential function of the differen- 
tial equation. The solution of the differential equation can be computed form the second 
equation above, w = c, and the result is 


y(t) = ce* + eAM per b(t) dt. 


This establishes the Theorem. 


Example 1.2.3. Find all solutions y to the differential equation 


3 
PSone é >, 


Solution: Rewrite the equation with y on only one side, 


3 
Pa ype? 
=u 


Multiply the differential equation by a function 4, which we determine later, 


w()(y'—2y) =P ult) > nly —Sal)y =# ule). 


t 
We need to choose a positive function yw having the following property, 
3 3 y(t) 3 ' 
—-p(t)=yW(t) => --= --=/(l 
= y(t) = p'(6) aa = = (In(|)) 


Integrating, 


3 = 
In(\al) = — fF dt = 3 In(le) +4 =In(lt1"*) + 6 => p= (£e%) mi), 


so we get ps = (+e?) |t|~3. We need only one integrating factor, so we choose pp = t~?. We 
now go back to the differential equation for y and we multiply it by this integrating factor, 


3 
t-3(y' -y) =¢t3P =a ¢3y/-—3t+*y=#. 
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3 


i 
Using that —3t~* = (t73)' and #? = (5) , we get 
By! et rie 
t Sy + (tJ y= (5) > (ry) = (=) => (e3y- 5) =0, 


3 
This last equation is a total derivative of a potential function y(t, y) = #7? y— 3" Since the 


equation is a total derivative, this confirms that we got a correct integrating factor. Now 
we need to integrate the total derivative, which is simple to do, 


4 $3 P a . 6 
t3y—-—=¢ => t3ys=c+. => ylt)=ct?+-—, 
y 3 y 3 y(t) 3 
where c is an arbitrary constant. <J 


Example 1.2.4. Find all solutions of ty’ = —2y + 4t?, with t > 0. 
Solution: Rewrite the equation as 


/ 


y= -5 y+4t & a(t)=—-, b(t) =4t. (1.2.6) 
Rewrite again, 
y + : y = At. 
Multiply by a function p, 
py! + aay = pt. 
Choose py solution of 


2 ya > In(lul=2 > nl) =2mn(le) =m) > w= 42 


We choose pp = t?. Multiply the differential equation by this pz, 
Py +%y=44? => (Py) =4e°. 
If we write the right-hand side also as a derivative, 
(y= @) = (Py-#)' =0. 
So a potential function is ~(t, y(t)) = t? y(t) — t*. Integrating on both sides we obtain 


Cc 
Py-t=ac = Py=c+t* 3s yt) = ate. 
J 


1.2.3. The Initial Value Problem. We now generalize Theorem 1.1.4—initial value 
problems have unique solutions—from constant coefficients to variable coefficients equations. 
We start introducing the initial value problem for a variable coefficients equation—a simple 
generalization of Def. 1.1.3. 


Definition 1.2.2. The initial value problem (IVP) is to find all solutions y of 
y =a(t)y + W(t), (1.2.7) 
that satisfy the initial condition 


Clty) = Hp (1.2.8) 
where a,b are given functions and to, Yo are given constants. 
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Remark: The Equation (1.2.8) is the initial condition of the problem. 
Although the differential equation in (1.2.7) has infinitely many solutions, the associated 
initial value problem has a unique solution. 
Theorem 1.2.3 (Variable coefficients IVP). Given continuous functions a,b, with domain 
(t1,t2), and constants to € (ti, t2) and yo € R, the initial value problem 
yi =alt)y+(t), — y(to) = Yo, (1.2.9) 
has the unique solution y on the domain (t,t), given by 


t 
y(t) = yet + eA) 7 eA) a(s) ds, (1.2.10) 


to 


t 
where the function A(t) = | a(s)ds is a particular antiderivative of function a. 
to 


Remark: In the particular case of a constant coefficient equation, where a,b € R, the 
solution given in Eq. (1.2.10) reduces to the one given in Eq. (1.1.12). Indeed, 


t t b b 
A(t) = / ads = a(t — to), / esto) bdg = —— eat) 4 
to to a a 
Therefore, the solution y can be written as 
y(t) = yp ett") + ealtto) (- en altto) *) 7 (u 4 °) ealt—to) _ 2 
a a a a 


Proof Theorem 1.2.3: Theorem 1.2.1 gives us the general solution of Eq. (1.2.9), 


y(t) = ced) + eA(t) je b(t) dt, cER. 


Let us use the notation K(t) = | e~4 b(t) dt, and then introduce the initial condition 
in (1.2.9), which fixes the constant c, 
Yo = y(to) = ce“) + eAlto) K(¢,), 


So we get the constant c, 
c= ye Al) — K(t,). 
Using this expression in the general solution above, 


y(t) = (vo e Alto) _ K(to)) eA + eAMO K(t) = yp eA Alo) 4 AM (K (4) — K(to)). 


Let us introduce the particular primitives A(t) = A(t) — A(t.) and K(t) = K(t) — K(to), 
which vanish at to, that is, 


A(t) = | ‘a(s)ds,  K(t) = / * e-Al) B(s) ds. 


to 
Then the solution y of the IVP has the form 


. t 
y(t) = yet + AY | e A(5) b(s) ds 
to 


which is equivalent to 


. t 
y(t) = yo eA + eAO-Alto) | e~ (Als)-Alto)) (5) ds, 


to 
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so we conclude that : 
y(t) =Yo eAlt) + Ae | e Als) b(s) ds. 


to 


Once we rename the particular primitive A simply by A, we establish the Theorem. 


We solve the next Example following the main steps in the proof of Theorem 1.2.3 
above. 


Example 1.2.5. Find the function y solution of the initial value problem 


ty’ + 2y = 4¢?, t > 0, y(1) = 2. 


Solution: In Example 1.2.4 we computed the general solution of the differential equation, 
c 
yWQ= ath, ceER. 
The initial condition implies that 
ll 
2=y)=c+1 => c=1 = y(t) = ate. 


dq 
Example 1.2.6. Find the solution of the problem given in Example 1.2.5, but this time 
using the results of Theorem 1.2.3. 


Solution: We find the solution simply by using Eq. (1.2.10). First, find the integrating 
factor function js as follows: 


A(t) = i : ds = —2[In(t) —In(1)] = —2In(t) = A(t) = In(t7”). 


8 
The integrating factor is u(t) = e~4™, that is, 

p(t) =e ED =O) st) = #7. 
Note that Eq. (1.2.10) contains e4 = 1/p(t). Then, compute the solution as follows, 


1 t 
y(t) = (2 +/ s’ 4s ds) 
a 1 - 


= 5+ 5 | 4s°ds 
eB’ Pf, 
2. lia 
—gtpe—) 
25 J I 


<q 


1.2.4. The Bernoulli Equation. In 1696 Jacob Bernoulli solved what is now known 
as the Bernoulli differential equation. This is a first order nonlinear differential equation. 
The following year Leibniz solved this equation by transforming it into a linear equation. 
We now explain Leibniz’s idea in more detail. 


Definition 1.2.4. The Bernoulli equation is 
y' = p(t)y+a(t)y”. (1.2.11) 


where p, q are given functions andne€ R. 
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Remarks: 


(a) For n £ 0,1 the equation is nonlinear. 
(b) If n = 2 we get the logistic equation, (we'll study it in a later chapter), 


= 1-2). 
ing ee 


(c) This is not the Bernoulli equation from fluid dynamics. 


The Bernoulli equation is special in the following sense: it is a nonlinear equation that 
can be transformed into a linear equation. 


Theorem 1.2.5 (Bernoulli). The function y is a solution of the Bernoulli equation 


y=pt)yta(t)y", nl, 
iff the function v = 1/y\"-)) is solution of the linear differential equation 
ol = —(n— 1)p(t)v — (n= alt). 


Remark: This result summarizes Laplace’s idea to solve the Bernoulli equation. To trans- 
form the Bernoulli equation for y, which is nonlinear, into a linear equation for v = 1/ yr), 
One then solves the linear equation for v using the integrating factor method. The last step 
is to transform back to y = (1/v)/("-»). 


Proof of Theorem 1.2.5: Divide the Bernoulli equation by y”, 
y’ _ ptt 
apa, q(t). 


y” a yrt 
Introduce the new unknown v = y~"~) and compute its derivative, 
(@) _ y'@) 
v= fyeO! =-@-ny ty = -2 -2o 
| (n—1) y(t) 


If we substitute v and this last equation into the Bernoulli equation we get 
i: 
U 


(n—1) 
This establishes the Theorem. 


=pt)ut+gt) = v'=—(n—-I1)p(t)v—(n—- lat). 


Example 1.2.7. Find every nonzero solution of the differential equation 
yl =yt2y?. 


Solution: This is a Bernoulli equation for n = 5. Divide the equation by y°, 


/ 
y 1 
ee. 
yy" 
Introduce the function v = 1/y* and its derivative v' = —4(y'/y°), into the differential 
equation above, 
/ 


-T=u+2 => v’=-4vu-8 => v'4+4v=-8. 


The last equation is a linear differential equation for the function v. This equation can be 
solved using the integrating factor method. Multiply the equation by p(t) = e**, then 


8 
(ctv) =-8e% = eMy= “| et +e. 
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We obtain that v = ce “ — 2. Since v = 1/y"*, 
1 
aA 
y 


— 
(c e—4t — 2) ue 


=ce*-2 3 y(t) =+ 


Example 1.2.8. Given any constants do, bo, find every solution of the differential equation 


y =aytby. 


Solution: This is a Bernoulli equation with n = 3. Divide the equation by y°, 


/ 


Y _ 4 
ye pt” 
Introduce the function v = 1/y? and its derivative v’ = —2(y'/y?), into the differential 


equation above, 


/ 
a av+tb = v =—2av—2b) = v' +2a.v = —2bp. 


The last equation is a linear differential equation for v. This equation can be solved using 
the integrating factor method. Multiply the equation by p(t) = e2%%, 


t bo 
(emer) = —2bo e220t = e2%0ty ot e2aot +e 
Ao 


b 
We obtain that v = ce~7* — =. Since v = 1/y?, 
Ao 


1 


b 1 
2 _ = 
y 


ult) =e 8 
‘a 


Qo (c e-2a0t — bo 


—2aot __ 


= ce 


Example 1.2.9. Find every solution of the equation ty’ = 3y +t y!/3. 


Solution: Rewrite the differential equation as 
3 
y! _ eae he 
This is a Bernoulli equation for n = 1/3. Divide the equation by y!/3, 


/ 
3 
YY _ ° y/s ee 


yt/3 + 
Define the new unknown function v = 1/ go. that is, v = y?/ 3, compute is derivative, 
2 y' 
= 


— 3 yi/3? 


and introduce them in the differential equation, 


3 3 2 
50 =e => ear ole 


This is a linear equation for v. Integrate this equation using the integrating factor method. 
To compute the integrating factor we need to find 


A(t) = / 2 dt = 21n(t) = In(t?). 
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Then, the integrating factor is (t) = e~4. In this case we get 
— In(t2 n(t~2 1 
p(t) =e PE) = EE) > u(t) = s- 
Therefore, the equation for v can be written as a total derivative, 
lap De. Bes v2 5\! 
==? = (5-<#) = 
Clare ae 2 9 
The potential function is w(t, v) = v/t? — (2/9)t? and the solution of the differential equation 
is #(t, v(t)) = ¢, that is, 


2 2 2 
og =e = v(t) = # (c+ 54) > u(t) =ct? + oe. 
Once v is known we compute the original unknown y = +v°/?, where the double sign is 


related to taking the square root. We finally obtain 


y(t) =+ (ct? + =¢) o 


Notes. This section corresponds to Boyce-DiPrima [3] Section 2.1, and Simmons [10] 
Section 2.10. The Bernoulli equation is solved in the exercises of section 2.4 in Boyce- 
Diprima, and in the exercises of section 2.10 in Simmons. 
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1.2.5. Exercises. 


1.2.1.- Find all solutions of 
y =Aty. 


1.2.2.- Find the general solution of 
y! =-y a et, 
1.2.3.- Find the solution y to the IVP 
y’ =yt2te*, y(0) =0. 


1.2.4.- Find the solution y to the IVP 


eee pint). v(2) a 


t 
for t > 0. 


1.2.5.- Find all solutions y to the ODE 


a Ay 
(#? + jy 


1.2.6.- Find all solutions y to the ODE 


ty’ +ny=t, 


with n a positive integer. 


1.2.7.- Find the solutions to the IVP 
2ty—y' =0, y(0) =3. 
1.2.8.- Find all solutions of the equation 
y = y— 2sin(t). 


1.2.9.- Find the solution to the initial value 
problem 


ty’ =2y+4t? cos(4t), —) =0. 
1.2.10.- Find all solutions of the equation 
y +ty=ty’. 
1.2.11.- Find all solutions of the equation 
y= —2y + 62 Vy. 

1.2.12.- Find all solutions of the IVP 

3 

y=yts, yO)=1. 

y 

1.2.13.- * Find all solutions of 


y’ =aytby”, 


where a # 0, b, and n are real constants 
with n £ 0,1. 
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1.3. Separable Equations 


1.3.1. Separable Equations. More often than not nonlinear differential equations 


are harder to solve than linear equations. Separable equations are an exception—they can 
be solved just by integrating on both sides of the differential equation. We tried this idea 
to solve linear equations, but it did not work. However, it works for separable equations. 


Definition 1.3.1. A separable differential equation for the function y is 


hy) y’ = g(t), 


where h,g are given functions. 


Remark: A separable differential equation is h(y) y’ = g(y) has the following properties: 


e The left-hand side depends explicitly only on y, so any t dependence is through y. 
e The right-hand side depends only on t. 
e And the left-hand side is of the form (something on y) x y’. 


Example 1.3.1. 


(a) 


t2 


The differential equation y’ = i is separable, since it is equivalent to 


—y? 
(1—y?) y=? oi 


The differential equation y’ + y? cos(2t) = 0 is separable, since it is equivalent to 
g(t) = — cos(2t), 


1 ‘i 
sy =-cos(2t) = 1 
y? A(y) = ae 
7] 
The functions g and h are not uniquely defined; another choice in this example is: 
1 
g(t) = cos(2t), Aly) = are 
The linear differential equation y’ = a(t) y is separable, since it is equivalent to 
g(t) = a(t), 
/ 
~y =alt) => 1 
y h(y) = y 


The equation y’ = e” + cos(t) is not separable. 
The constant coefficient linear differential equation y’ = a) y+ bo is separable, since it 
is equivalent to 
i g(t) = 1, 
/ 

a yt h(y) = ———_.. 

( oY ) (y) (ag y 4 bo) 
The linear equation y’ = a(t) y+ b(t), with a 4 0 and b/a nonconstant, is not separable. 

dq 


From the last two examples above we see that linear differential equations, with a 4 0, 


are separable for b/a constant, and not separable otherwise. Separable differential equations 
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are simple to solve. We just integrate on both sides of the equation. We show this idea in 
the following example. 


Example 1.3.2. Find all solutions y to the differential equation 


/ 


Yy 
oa cos(2t). 


Solution: The differential equation above is separable, with 


g(t) =cos(2t), — -A(y) = we 
Therefore, it can be integrated as follows: 
/ 
a =cos(2t) <= [- = J cost20) dt +c. 


The integral on the right-hand side can be computed explicitly. The integral on the left-hand 
side can be done by substitution. The substitution is 


u = y(t), du=y'(t)dt => [-3 dt = [cost2e) dt +c. 


This notation makes clear that u is the new integation variable, while y(t) are the unknown 
function values we look for. However it is common in the literature to use the same name 
for the variable and the unknown function. We will follow that convention, and we write 
the substitution as 


y = y(t), dy = y'( > /-3 dt = J cost2e) dt +c. 
We hope this is not too confusing. Integrating on both sides above we get 
1 1 
; =e sin(2t) + ¢. 


So, we get the implicit and explicit form of the solution, 


Pose eghe eh Sie 
y(t) 2 ac : we = sin(2t) + 2c 


Remark: Notice the following about the equation and its implicit solution: 
il 
“2B y’ = cos(2t) = hiy)y=9%), Ah) =->G, — g(t) =cos(2t), 


i = 5 sin(2t) = H(y)=G(t), Aly)= G(t) = 5 sin(2t) 


1 
y y’ 
e Here H is an antiderivative of h, that is, H(y) = f h(y) dy. 
e Here G is an antiderivative of g, that is, G(t) = [ g(t) dt. 


Theorem 1.3.2 (Separable Equations). If h,g are continuous, with h #0, then 
h(y) y' = g(t) (1.3.1) 


has infinitely many solutions y satisfying the algebraic equation 
H(y(t)) = Gt) +¢, (1.3.2) 


where c € R is arbitrary, H and G are antiderivatives of h and g. 
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Remark: An antiderivative of h is H(y) = f h(y)dy, while an antiderivative of g is the 
function G(t) = [ g(t) dt. 


Proof of Theorem 1.3.2: Integrate with respect to t on both sides in Eq. (1.3.1), 


hy)y! =9(t) > [row @a=fotatre, 


where c is an arbitrary constant. Introduce on the left-hand side of the second equation 
above the substitution 

y=y(t), dy=y'(t) dt. 
The result of the substitution is 


fruwv@ae= frondy + [niyay= fgar+e. 


To integrate on each side of this equation means to find a function H, primitive of h, and 
a function G, primitive of g. Using this notation we write 


Hy) = [rudy Gt) = f g(a. 
Then the equation above can be written as follows, 
H(y) = G(t) +6, 
which implicitly defines a function y, which depends on t. This establishes the Theorem. 


Example 1.3.3. Find all solutions y to the differential equation 
t? 


/ 


y 


(1.3.3) 


re 
Solution: We write the differential equation in (1.3.3) in the form h(y) y’ = g(t), 
(1 = y) y= i. 
In this example the functions h and g defined in Theorem 1.3.2 are given by 
h(y)=(1—y’), g(t) = #?. 
We now integrate with respect to t on both sides of the differential equation, 


[o-vo)v@a= [Pare 


where c is any constant. The integral on the right-hand side can be computed explicitly. 
The integral on the left-hand side can be done by substitution. The substitution is 


y=10, wa¥Qa = fo-vo)v@a=fa-P)ay 


This substitution on the left-hand side integral above gives, 


y? p 
[a-Pay= [ease S&S y-s>= +0. 


3 3 
The equation above defines a function y, which depends on t. We can write it as 
3 3 
y(t) _¢t 
j= oe 
W-- is 


We have solved the differential equation, since there are no derivatives in the last equation. 
When the solution is given in terms of an algebraic equation, we say that the solution y is 
given in implicit form. <J 
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Definition 1.3.3. A function y is a solution in implicit form of the equation h(y) y’ = g(t) 
iff the function y is solution of the algebraic equation 


H(y(t)) = G(t) +e, 


where H and G are any antiderivatives of h and g. In the case that function H is invertible, 
the solution y above is given in explicit form iff is written as 


y(t) = H~*(G(t) +e). 


In the case that H is not invertible or H~! is difficult to compute, we leave the solution 
y in implicit form. We now solve the same example as in Example 1.3.3, but now we just 
use the result of Theorem 1.3.2. 
Example 1.3.4. Use the formula in Theorem 1.3.2 to find all solutions y to the equation 
t? 
Pia 


/ 


y 


(1.3.4) 


a 

Solution: Theorem 1.3.2 tell us how to obtain the solution y. Writing Eq. (1.3.4) as 
(l-y)y =, 

we see that the functions h, g are given by 


h(y)=1-y*, g(t) =?’ 


Their primitive functions, H and G, respectively, are simple to compute, 


Yy 
hy)=1-y° = Hly)=y-F, 
£3 
j=" => Cee 3 
Then, Theorem 1.3.2 implies that the solution y satisfies the algebraic equation 
3 3 
y(t) _¢t 
t)— =— 1.3.5 
y(t) EP = Ste, (1.3.5) 
where c € R is arbitrary. <J 


Remark: Sometimes it is simpler to remember ideas than formulas. So one can solve a 
separable equation as we did in Example 1.3.3, instead of using the solution formulas, as in 
Example 1.3.4. (Although in the case of separable equations both methods are very close.) 


In the next Example we show that an initial value problem can be solved even when 
the solutions of the differential equation are given in implicit form. 
Example 1.3.5. Find the solution of the initial value problem 
t? 


y = --? y(0) = 1. (1.3.6) 


Solution: From Example 1.3.3 we know that all solutions to the differential equation 
in (1.3.6) are given by 
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where c € Ris arbitrary. This constant c is now fixed with the initial condition in Eq. (1.3.6) 


3(0) 0 1 2 “) -F , 2 
rao te > a cx c 3 > y(t) yd) _ fe 
So we can rewrite the algebraic equation defining the solution functions y as the (time 


dependent) roots of a cubic (in y) polynomial, 
y(t) — 3y(t) +27 +2=0. 


Example 1.3.6. Find the solution of the initial value problem 
y’ + y* cos(2t) = 0, y(0) = 1. (13-7) 
Solution: The differential equation above can be written as 
1 
—~ y! = cos(2t). 
y 
We know, from Example 1.3.2, that the solutions of the differential equation are 
2 
t — 
y(t) = Salat) + De" 
The initial condition implies that 
1 = 0 SS 
WO) = 079 


So, the solution to the IVP is given in explicit form by 
2 
sin(2t) + 2° 


cER. 


c=1. 


y(t) = 


Example 1.3.7. Follow the proof in Theorem 1.3.2 to find all solutions y of the equation 
, 4t-8 
44 y3° 


y 


Solution: The differential equation above is separable, with 
h(y)=4+y", g(t) =4t-#. 


Therefore, it can be integrated as follows: 


(4+y%)y =4t-P © [Urvo)v a= fat-P)dre. 


Again the substitution 
y=y(t), dy=y"(t)dt 
implies that 
y" ie 
[Otway = [at-8)at +e. dy + 7 = 2¢? — ate. 


Calling c, = 4c) we obtain the following implicit form for the solution, 
y*(t) + 16y(t) — 84? + ¢4 = q. 
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Example 1.3.8. Find the solution of the initial value problem below in explicit form, 


a. (1.3.8) 


/ 


Solution: The differential equation above is separable with 


Ay)=1t+y, g(t) =2-¢. 
Their primitives are respectively given by, 


2 
h(tyy=1l+y => Hy) =y+4, 


2 
g(t)=2-t 3 Gt) =2- 5. 


Therefore, the implicit form of all solutions y to the ODE above are given by 


2 2 
y(t) t 
t = 2t-— — 
yt) + > ane 
with c € R. The initial condition in Eq. (1.3.8) fixes the value of constant c, as follows, 
(0 1 3 

y(0) + ¥) <o46 = it 6 = 2a5 
We conclude that the implicit form of the solution y is given by 

2(t Pr. 3 

y(t) 4 vO _ 9 vie ee y’ (t) + 2y(t) + (#? — 4t-— 3) =0. 


The explicit form of the solution can be obtained realizing that y(t) is a root in the quadratic 
polynomial above. The two roots of that polynomial are given by 


y.(t) = Z(29 +/4-4(2-4t-3)]) © y(t) =-1tV-t?2+4t4+4. 


2 


We have obtained two functions y, and y-. However, we know that there is only one solution 
to the initial value problem. We can decide which one is the solution by evaluating them at 
the value t = 0 given in the initial condition. We obtain 


y.(0) =-1- V4=-3. 
Therefore, the solution is y,, that is, the explicit form of the solution is 


y(t) = -1+ #2 + 4t4 4. 


J 


1.3.2. Euler Homogeneous Equations. Sometimes a differential equation is not 
separable but it can be transformed into a separable equation changing the unknown func- 
tion. This is the case for differential equations known as Euler homogenous equations. 


Definition 1.3.4. An Euler homogeneous differential equation has the form 
t 
y(t) = a, 


t 


Remark: 
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(a) Any function F' of t,y that depends only on the quotient y/t is scale invariant. This 
means that F’ does not change when we do the transformation y — cy, t — ct, 


CB) = FG): 


For this reason the differential equations above are also called scale invariant equations. 


(b) Scale invariant functions are a particular case of homogeneous functions of degree n, 
which are functions f satisfying 


f(ct, cy) =c” fly, t). 
Scale invariant functions are the case n = 0. 


(c) An example of an homogeneous function is the energy of a thermodynamical system, 
such as a gas in a bottle. The energy, F, of a fixed amount of gas is a function of the gas 
entropy, S, and the gas volume, V. Such energy is an homogeneous function of degree 
one, 


E(cS,cV) =cE(S,V), for allc ER. 


Example 1.3.9. Show that the functions f; and fo are homogeneous and find their degree, 
Ady =ty +t? +y, — falt,y) =? y? + ty’. 
Solution: The function f; is homogeneous of degree 6, since 
fi(ct, cy) = c4t* cy? + ctc®y® + 343 By? = c8 (tty? + ty® + ty?) = & F(t,y). 


Notice that the sum of the powers of t and y on every term is 6. Analogously, function fo 
is homogeneous degree 4, since 


fo(ct,cy) = PP Cy +eFy® =c1 (Py? + ty’) =c* fa(t.y). 


And the sum of the powers of t and y on every term is 4. J 


Example 1.3.10. Show that the functions below are scale invariant functions, 


+t? +t 2 3 
Alty)=4,  flty)=— a. 


p+ ty? 
Solution: Function f; is scale invariant since 
cy sy 
t == 2 => f(t,y). 
i(ect,y) == 4 = sey) 


The function f2 is scale invariant as well, since 


ere + ct? cy + ct cy? + 3y 7 Ce +Pytty? + y) 


£, = 
Ja(et, cy) c3t3 + ct cy? c3(t3 + ty?) 


_ fo(t,y). 


J 


More often than not, Euler homogeneous differential equations come from a differential 
equation N y’+ M = 0 where both N and M are homogeneous functions of the same degree. 
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Theorem 1.3.5. If the functions N, M, of t,y, are homogeneous of the same degree, then 
the differential equation 


N(t,y) y(t) + M(t, y) = 0 


is Euler homogeneous. 


Proof of Theorem 1.3.5: Rewrite the equation as 


M(t, y) 
/ 2 
y t)=— ? 
O= Ney) 
Mit 
The function f(y, y) = — (ty) is scale invariant, because 
N(t,y) 
M (ct, cy c” M(t, y Mit,y 
f(ct,cy) = AN) __ MW) __MAOY) _ Ft,y), 


N(ct, cy) c” N(t,y) N(t,y) 


where we used that M and N are homogeneous of the same degree n. We now find a function 
F such that the differential equation can be written as 


v-F(0. 


Since M and N are homogeneous degree n, we multiply the differential equation by “1” in 
the form (1/t)”/(1/t)”, as follows, 
M(t,y) (1/t") _ — M((t/t), (y/t)) _ _ M1, (y/*)) 


/ as ? = = — y 
'O=—-Tey UP) ~~ MG GD) ~My 7 Y= FG): 


where 


This establishes the Theorem. 


2 
Example 1.3.11. Show that (t—y) y’ —2y+3¢4 : = 0 is an Euler homogeneous equation. 


Solution: Rewrite the equation in the standard form 


y2 
(¢—y)y = 2y-3t- "= 
So the function f in this case is given by 
y) 
y 
2y- 3-2) 
( 2 t 
(t—y) 


This function is scale invariant, since numerator and denominator are homogeneous of the 
same degree, n = 1 in this case, 


f(t,y) _ 


py? 2 
2cy — 3ct — =.) c(2y — 3t — *) 
f(ct,cy) = a+ = = f(t,y). 

(ct — cy) c(t — y) 
So, the differential equation is Euler homogeneous. We now write the equation in the form 
y’ = F(y/t). Since the numerator and denominator are homogeneous of degree n = 1, we 
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multiply them by “1” in the form (1/t)'/(1/t)', that is 
2 


y 
pe hase CO 
((—y) — (1/t) 
Distribute the factors (1/t) in numerator and denominator, and we get 


po 3) y = F(4), 


(1 — (y/t)) t 
where : 
p(¥) = Cul =3- w/e") 
; (1 (v/t)) 
So, the equation is Euler homogeneous and it is written in the standard form. <l 


Example 1.3.12. Determine whether the equation (1 — y?) y’ = t? is Euler homogeneous. 


Solution: If we write the differential equation in the standard form, y’ = f(t, y), then we 
2 


t 
242 


f (ct, cy) = 1— ys # F(t. ¥), 


hence the equation is not Euler homogeneous. <J 


1.3.3. Solving Euler Homogeneous Equations. In § 1.2 we transformed a Bernoulli 
equation into an equation we knew how to solve, a linear equation. Theorem 1.3.6 trans- 
forms an Euler homogeneous equation into a separable equation, which we know how to 
solve. 


Theorem 1.3.6. The Euler homogeneous equation 


=F) 


for the function y determines a separable equation for v = y/t, given by 


Remark: The original homogeneous equation for the function y is transformed into a sep- 
arable equation for the unknown function v = y/t. One solves for v, in implicit or explicit 
form, and then transforms back to y = tv. 


Proof of Theorem 1.3.6: Introduce the function v = y/t into the differential equation, 
y! = F(v). 
We still need to replace y’ in terms of v. This is done as follows, 
y(t) =tv) => y'()=v(t)+tv'(t). 
Introducing these expressions into the differential equation for y we get 


ri ,_ (F(v)-») v! _l 
v+tu =F(v) = ee ae (Fe)=s) > 


The equation on the far right is separable. This establishes the Theorem. 


+ | 
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+ 3y? 
Example 1.3.13. Find all solutions y of the differential equation y’ = ss 
y 
Solution: The equation is Euler homogeneous, since 
242 2,2 2/42 2 2 2 
ct? + 8c*y c(t? + 3y*) = t*# + 3y 
f(et, cy) ~~ = Ft, y)- 


Act\cy) Qty) ty 
Next we compute the function F’. Since the numerator and denominator are homogeneous 
degree “2” we multiply the right-hand side of the equation by “1” in the form (1/t?)/(1/t), 


ea (7) r+a(Z) 
Qty (=) ; 
t2 


Now we introduce the change of functions v = y/t, 


> y= 


je 3v2 
foo 
Since y = tv, then y/ = v+tv’, which implies 
er = 1+ 3v? = gylles 1+ 3v? 7 1+ 3v? — 2v? _ 1+07 
2u 2vu 2v 2u 
We obtained the separable equation 
te il € + =) 
t 2u 


We rewrite and integrate it, 


2v =, iI Qu, ic 
192” t foe t aie 


The substitution u = 1+ v?(t) implies du = 2v(t) v’(t) dt, so 


d dt 
fi a et ea In(u) =In(t)+eo > uw=eln)teo, 
U 
But u = ele, so denoting c; = e, then u = cit. So, we get 


2 
ltv®=at > 14+(2) =at > yt) =4tVat—1. 


<q 


ibd) yt 1)? 

o 
Solution: This equation is Euler homogeneous when written in terms of the unknown 
u(t) = y(t) + 1 and the variable t. Indeed, u’ = y’, thus we obtain 


yaw Det yy _ tut /_u (*)” 


Example 1.3.14. Find all solutions y of the differential equation y’ = 


p Uu= p => pee 


Therefore, we introduce the new variable v = u/t, which satisfies u = tv and u’ =v+tuv’. 
The differential equation for v is 


/ 
1 
vtty =v4+v? ty’ =v [aaa [atte 
v t 


with c€ R. The substitution w = v(t) implies dw = v’ dt, so 
1 


1 
—2 = . — oe = — 
fe aw = [ Zat+e S w In(lt|)) +e & w me 
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Substituting back v, u and y, we obtain w = v(t) = u(t)/t = [y(t) + 1]/t, so 
y+l _ 1 t 
t — In([t|) +e ri 


Notes. This section corresponds to Boyce-DiPrima [3] Section 2.2. Zill and Wright study 
separable equations in [17] Section 2.2, and Euler homogeneous equations in Section 2.5. 
Zill and Wright organize the material in a nice way, they present first separable equations, 
then linear equations, and then they group Euler homogeneous and Bernoulli equations in 
a section called Solutions by Substitution. Once again, a one page description is given by 
Simmons in [10] in Chapter 2, Section 7. 
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1.3.4. Exercises. 


1.3.1.- Find all solutions y to the ODE 
2 
/ i 


y=. 
y 


Express the solutions in explicit form. 
1.3.2.- Find every solution y of the ODE 
3t? + 4y?y’ —1+y' =0. 


Leave the solution in implicit form. 


1.3.3.- Find the solution y to the IVP 
y=ty’, y(0)=1. 
1.3.4.- Find every solution y of the ODE 


ty+V14+##y' =0. 


1.3.5.- Find every solution y of the Euler 
homogeneous equation 
r_Y +t 
y=: 


1.3.6.- Find all solutions y to the ODE 
We v? + y? 
acer 
1.3.7.- Find the explicit solution to the IVP 
(+2ty)y'=y", y(Q)=1. 


1.3.8.- Prove that if y’ = f(t, y) is an Euler 
homogeneous equation and y;(¢) is a so- 
lution, then y(t) = (1/k) y:(kt) is also a 
solution for every non-zero k € R. 


1.3.9.- * Find the explicit solution of the 
initial value problem 
,__ 4t — 6¢? 


aa y(0) = —3. 
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1.4. Exact Differential Equations 


A differential equation is exact when is a total derivative of a function, called potential 
function. Exact equations are simple to integrate—any potential function must be constant. 
The solutions of the differential equation define level surfaces of a potential function. 

A semi-exact differential equation is an equation that is not exact but it can be trans- 
formed into an exact equation after multiplication by a function, called an integrating fac- 
tor. An integrating factor converts a non-exact equation into an exact equation. Linear 
equations, studied in § 1.1 and § 1.2, are a particular case of semi-exact equations. The 
integrating factor of a linear equation transforms it into a total derivative—hence, an exact 
equation. We now generalize this idea to a class of nonlinear equations. 


1.4.1. Exact Equations. A differential equation is exact if certain parts of the differ- 
ential equation have matching partial derivatives. We use this definition because it is simple 
to check in concrete examples. 


Definition 1.4.1. An exact differential equation for y is 
N(t,y)y' + M(t,y) =0 
where the functions N and M satisfy 
O,N(t, y) = OyM(t, y) 


Remark: The functions NV, M depend on t, y, and we use the notation for partial derivatives 


ON OM 

= Oy M = —. 

Ot’ ¥ Oy 

In the definition above, the letter y has been used both as the unknown function (in the 
first equation), and as an independent variable (in the second equation). We use this dual 


meaning for the letter y throughout this section. 


ON = 


Our first example shows that all separable equations studied in § 1.3 are exact. 


Example 1.4.1. Show whether a separable equation h(y) y’(t) = g(t) is exact or not. 
Solution: If we write the equation as h(y) y’ — g(t) = 0, then 

N(t,y)=h(y) => OAN(t,y) =0, 

M(t,y)=g9(t) = OM(t,y) =0, " 
hence every separable equation is exact. <l 


The next example shows that linear equations, written as in § 1.2, are not exact. 


Example 1.4.2. Show whether the linear differential equation below is exact or not, 
y'(t) = a(t) y(t) + b(t), a(t) #0. 
Solution: We first find the functions N and M rewriting the equation as follows, 
y +a(t)y—bt)=0 = N(t,y)=1, M(t,y) =—alt)y—d(t). 
Let us check whether the equation is exact or not, 
N(t,y) =1 => Od,N(t,y) =0, 


M(t,y) = —a(t\y-() > eo i a 
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So, the differential equation is not exact. <J 


The following examples show that there are exact equations which are not separable. 


Example 1.4.3. Show whether the differential equation below is exact or not, 


Qty y! + 2t+y? =0. 


Solution: We first identify the functions N and M. This is simple in this case, since 
(2ty)y’ + (é+y?)=0 => N(é,y)=2ty, M(t,y) =2¢+y?. 
The equation is indeed exact, since 
N(i,y) = 2ty => ON(t,y) = 2y, 
2 => ON(t,y) =M(t,y). 
M(t,y)=2t+y" = dM(t,y) = 2y, 


Therefore, the differential equation is exact. <J 


Example 1.4.4. Show whether the differential equation below is exact or not, 


sin(t) y’ + t?e¥ y’ — y’ = —ycos(t) — 2te¥. 


Solution: We first identify the functions N and M by rewriting the equation as follows, 
(sin(t) + t?e” — 1) y' + (ycos(t) + 2te”) = 0 


we can see that 


N(t, y) = sin(t) + t?e¥ —1 > O.N(t,y) = cos(t) + 2te’, 
M(t, y) = ycos(t) + 2te” > OyM (E, y) = cos(t) + 2te®. 
Therefore, 0,N(t,y) = 0,M(t,y), and the equation is exact. J 


1.4.2. Solving Exact Equations. Exact differential equations can be rewritten as a 
total derivative of a function, called a potential function. Once they are written in such way 
they are simple to solve. 


Theorem 1.4.2 (Exact Equations). If the differential equation 
N(t,y)y' + M(t, y) =0 (1.4.1) 


is exact, then it can be written as 


i 


iN=0 
7 (t,y(t)) = 0, 
where w is called a potential function and satisfies 

N=0,%, M = 0p. (1.4.2) 


Therefore, the solutions of the exact equation are given in implicit form as 


w(t, y(t)) =e, ceER. 


Remark: The condition 0,N = 0,M is equivalent to the existence of a potential function— 
result proven by Henri Poincaré around 1880. 


38 1. FIRST ORDER EQUATIONS 


Theorem 1.4.3 (Poincaré). Continuously differentiable functions N, M, ont, y, satisfy 


iff there is a twice continuously differentiable function Ww, depending on t, y such that 


Remarks: 


(a) A differential equation defines the functions N and M. The exact condition in (1.4.3) 
is equivalent to the existence of 7), related to N and M through Eq. (1.4.4). 

(b) If we recall the definition of the gradient of a function of two variables, Vy) = (O,w, Oy~), 
then the equations in (1.4.4) say that Vw = (M,N). 


Proof of Theorem 1.4.3: 
(=>) It is not given. See [9]. 
(<) We assume that the potential function w is given and satisfies 
N = dy, M = On. 
Recalling that w is twice continuously differentiable, hence 0,0yw = 0,0,, then we have 
ON = Oy = Oy = OyM. 


In our next example we verify that a given function w is a potential function for an 
exact differential equation. We also show that the differential equation can be rewritten as 
a total derivative of this potential function. (In Theorem 1.4.2 we show how to compute 
such potential function from the differential equation, integrating the equations in (1.4.4).) 


Example 1.4.5 (Verification of a Potential). Show that the differential equation 
Qtyy’ + 2t+y? =0. 
is the total derivative of the potential function ~(t, y) = t? + ty?. 


Solution: we use the chain rule to compute the t derivative of the potential function w 
evaluated at the unknown function y, 


<vltsu(t)) = (ay) + (av) 
= (Qty) y' + (2¢+y”). 


So the differential equation is the total derivative of the potential function. To get this 
result we used the partial derivatives 


Oyt) = 2ty = N, On = 2t+y?=M. 
<q 


Exact equations always have a potential function ~, and this function is not difficult 
to compute—we only need to integrate Eq. (1.4.4). Having a potential function of an exact 
equation is essentially the same as solving the differential equation, since the integral curves 
of w define implicit solutions of the differential equation. 

Proof of Theorem 1.4.2: The differential equation in (1.4.1) is exact, then Poincaré 
Theorem implies that there is a potential function ~ such that 


N=0,b, M=dyp. 
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Therefore, the differential equation is given by 
0= Nit,y) y(t) + M(t,y) 
= (yv(t.y)) y' + (vty) 


d 
= —v(t, y(t 
“u(t.u(b)) 
where in the last step we used the chain rule. Recall that the chain rule says 
d dy 
qe eyo) = (dy) ae + (Or). 


So, the differential equation has been rewritten as a total t-derivative of the potential func- 
tion, which is simple to integrate, 


TwUlty()=0 + vlbyl) <6 


where c is an arbitrary constant. This establishes the Theorem. 


Example 1.4.6 (Calculation of a Potential). Find all solutions y to the differential equation 
Qtyy’ + 2t+y? =0. 
Solution: The first step is to verify whether the differential equation is exact. We know 


the answer, the equation is exact, we did this calculation before in Example 1.4.3, but we 


reproduce it here anyway. 
N(i,y) = 2ty => ON(t,y) = 2y, 
2 => OAN(t,y) =M(t,y). 
M(t,y)=2t+y° => Od,M(t,y) =2y. 


Since the equation is exact, Lemma 1.4.3 implies that there exists a potential function 
satisfying the equations 


Ow(t,y) = M(t, y). (1.4.6) 
Let us compute w. Integrate Eq. (1.4.5) in the variable y keeping the variable ¢ constant, 


Oy(t,y) = 2ty => V(ty)= [rtvdy +90, 

where g is a constant of integration on the variable y, so g can only depend on t. We obtain 

w(t, y) = ty” +9(t). (1.4.7) 
Introduce into Eq. (1.4.6) the expression for the function w in Eq. (1.4.7) above, that is, 

y+ 9'(t) = Ov(t,y) = M(t,y) =2t+y? > g(t) =2t 
Integrate in t the last equation above, and choose the integration constant to be zero, 
goeSF, 
We have found that a potential function is given by 
b(t,y) = ty? +t. 
Therefore, Theorem 1.4.2 implies that all solutions y satisfy the implicit equation 
ty? (t) +t? =c, 

for any c € R. The choice g(t) = t? + c only modifies the constant c. <J 
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Remark: An exact equation and its solutions can be pictured on the graph of a potential 
function. This is called a geometrical interpretation of the exact equation. We saw that an 
exact equation N y’+M = 0 can be rewritten as dy)/dt = 0. The solutions of the differential 
equation are functions y such that w(t, y(t)) = c, hence the solutions define level curves of 
the potential function. Given a level curve, the vector r(t) = (t, y(t)), which belongs to the 
ty-plane, points to the level curve, while its derivative r/(t) = (1,y’(t)) is tangent to the 
level curve. Since the gradient vector Vy = (M,N) is perpendicular to the level curve, 


rLlVb & r-Vb=0 & M+Ny' =0. 
We wanted to remark that the differential equation can be thought as the condition r’ L Vw. 


As an example, consider the differential equa- 
tion zn z= (ty) 
2yy’ + 2t =0, 
which is separable, so it is exact. A potential 

function is 

patty’, 
a paraboloid shown in Fig. 2. Solutions y are 
defined by the equation t? + y? = c, which are 
level curves of w for c > 0. The graph of a 
solution is shown on the ty-plane, given by 


y(t) =+Vc-—??. 
As we said above, the vector r(t) = (t, y(t)) 
points to the solution’s graph while its deriv- 
ative r’(t) = (1,y'(t)) is tangent to the level 
cuve. We also know that the gradient vec- 
tor Vw = (2t, 2y) is perpendicular to the level t y(t) =+Ve-@ 
curve. The condition : 


rlVy = -Vy=0, FIGURE 2. Potential w with 
is precisely the differential equation, level curve 7 = c defines a 
1 + 2yy/ =0 solution y on the ty-plane. 


Example 1.4.7 (Calculation of a Potential). Find all solutions y to the equation 
sin(t) y’ + t?e¥ y’ — y' + ycos(t) + 2te¥ — 3t? = 0. 


Solution: The first step is to verify whether the differential equation is exact. 


N(t,y) = sin(t) +t?e¥ —1 = 0: N(t, y) = cos(t) + 2te®, 

M(t, y) = ycos(t) + 2te¥ — 3¢? => O,M(E, y) = cos(t) + 2te?. 
So, the equation is exact. Poincaré Theorem says there is a potential function ~ satisfying 
Oyb(t.y)=N(t,y), Av(t,y) = M(t,y). (1.4.8) 


To compute ~ we integrate on y the equation 0, = N keeping t constant, 


dyv(t,y) =sin(t)+Pe%-1 > vi(ty)= J teint + t?e¥ — 1) dy + g(t) 
where g is a constant of integration on the variable y, so g can only depend on t. We obtain 


v(t, y) = ysin(t) + te — y + g(t). 
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Now introduce the expression above for 7 in the second equation in Eq. (1.4.8), 
ycos(t) + 2te¥ + g'(t) = OwW(t, y) = M(t, y) = ycos(t) + 2te¥ —3t2 => g'(t) = —3#?. 


The solution is g(t) = —t? +c, with co a constant. We choose c = 0, so g(t) = —t?. We 
found g, so we have the complete potential function, 


w(t,y) = ysin(t) + te¥ —y— 0°. 


Theorem 1.4.2 implies that any solution y satisfies the implicit equation 


y(t) sin(t) + t2e¥ — y(t) —#B =c. 


The solution y above cannot be written in explicit form. If we choose the constant c, # 0 
in g(t) = —t? +c, we only modify the constant c above. <J 


Remark: A potential function is also called a conserved quantity. This is a reasonable name, 
since a potential function evaluated at any solution of the differential equation is constant 
along the evolution. This is yet another interpretation of the equation dw/dt = 0, or its 
integral 2(t, y(t)) = c. If we call c = y = (0, y(0)), the value of the potential function at 
the initial conditions, then ~(t, y(t)) = Yo. 

Conserved quantities are important in physics. The energy of a moving particle is a 
famous conserved quantity. In that case the differential equation is Newton’s second law of 
motion, mass times acceleration equals force. One can prove that the energy E of a particle 
with position function y moving under a conservative force is kept constant in time. This 
statement can be expressed by E(t, y(t), y’(t)) = Eo, where E, is the particle’s energy at the 
initial time. 


1.4.3. Semi-Exact Equations. Sometimes a non-exact differential equation can be 
rewritten as an exact equation. One way this could happen is multiplying the differential 
equation by an appropriate function. If the new equation is exact, the multiplicative function 
is called an integrating factor. 


Definition 1.4.4. A semi-exact differential equation is a non-exact equation that can be 
transformed into an exact equation after a multiplication by an integrating factor. 


Example 1.4.8. Show that linear differential equations y’ = a(t) y + b(t) are semi-exact. 


Solution: We first show that linear equations y’ = ay + b with a £ 0 are not exact. If we 
write them as 
y -—ay—-b=0 3S Ny+Me=0 with N=1, M=-—ay-—b. 
Therefore, 
OaN=0, AM=-a => ON Fd,M. 


We now show that linear equations are semi-exact. Let us multiply the linear equation by 
a function 4, which depends only on ft, 


u(t) y’ — a(t) w(t) y — w(t) b(t) = 0, 
where we emphasized that 1, a, b depende only on t. Let us look for a particular function 
p that makes the equation above exact. If we write this equation as N y’ + M = 0, then 


N(t,y) =p, M(t,y) =—apy— pb. 
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We now check the condition for exactness, 
aN=p', dM =—-auy, 
and we get that 
aN = 0,M ! me pw =—ap 
the equation is exact i is an integrating factor. 


Therefore, the linear equation y’ = ay + b is semi-exact, and the function that transforms 
it into an exact equation is p(t) = e~4M, where A(t) = f a(t) dt, which in § 1.2 we called 
it an integrating factor. <J 


Now we generalize this idea to nonlinear differential equations. 


Theorem 1.4.5. If the equation 


Ni(t,y)y' + M(t,y) =0 (1.4.9) 
is not exact, with 0,N A OyM, the function N £0, and where the function h defined as 
O,M —0,N 
j= 1.4.10 
x (1.4.10) 
depends only on t, not on y, then the equation below is exact, 
(e@ N)y' + (e#M) =0, (1.4.11) 


where H is an antiderivative of h, 


Remarks: 


(a) The function p(t) = e# is called an integrating factor. 
(b) Any integrating factor pu is solution of the differential equation 


p'(t) = h(t) pt). 
(c) Multiplication by an integrating factor transforms a non-exact equation 
Ny +M=0 
into an exact equation. 
(UN) y! + (uM) = 0. 

This is exactly what happened with linear equations. 
Verification Proof of Theorem 1.4.5: We need to verify that the equation is exact, 

(e* N)y +(e*M)=0 > Mt,y)=e"ON(t,y), M(t,y) =e% Mity). 
We now check for exactness, and let us recall 0;(e”) = (e”)! = he”, then 

aAN=he¥N+e"%O,N, O,M =e" O,M. 

Let us use the definition of h in the first equation above, 
OyM — 0,N) 

N 


O,N = e# ( N+O,N =e" 9,M =0,M. 
y y 


So the equation is exact. This establishes the Theorem. 
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Constructive Proof of Theorem 1.4.5: The original differential equation 
Ny +M=0 


is not exact because 0:N # 0,M. Now multiply the differential equation by a nonzero 
function p that depends only on f, 


(uN) y! + (uM) = 0. (1.4.12) 


We look for a function 2 such that this new equation is exact. This means that j must 
satisfy the equation 


&y(uN) = d,(4M). 
Recalling that 44 depends only on ¢ and denoting 0; = py’, we get 
wUN+pQnN=pyM => pw N=p(0,M — ON). 
So the differential equation in (1.4.12) is exact iff holds 
r (os _ oN) 
#=(—— — ) # 
The solution p will depend only on ¢ iff the function 
N(t,y) 


depends only on t. If this happens, as assumed in the hypotheses of the theorem, then we 
can solve for ys as follows, 


WO) =HE) we) > wt)=e", HH) = f rat, 
Therefore, the equation below is exact, 


(e# N)y! + (e% M) =0. 


h(t) 


This establishes the Theorem. 


Example 1.4.9. Find all solutions y to the differential equation 
(i? +ty) y + (Bty+y") =0. (1.4.13) 


Solution: We first verify whether this equation is exact: 
N(t,y) =? +ty > ON (t,y) = 2t+y, 
M(t,y) = 3ty + y? = d,M(t,y) = 3t + 2y, 


therefore, the differential equation is not exact. We now verify whether the extra condition 
in Theorem 1.4.5 holds, that is, whether the function in (1.4.10) is y independent; 


: Now) 
_ (8t + 2y) — (2t+ y) 
(? + ty) 
_ (t+y) 
t(t+y) 
1 1 
a Soe A(t) = re 
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So, the function h = (0,M — 0,N)/N is y independent. Therefore, Theorem 1.4.5 implies 
that the non-exact differential equation can be transformed into an exact equation. We need 
to multiply the differential equation by a function yz solution of the equation 


yi(t) = h(t) p(t) m : GOK) + wah=e 


where we have chosen in second equation the integration constant to be zero. Then, multi- 
plying the original differential equation in (1.4.13) by the integrating factor 4 we obtain 


(30? y+ty?) + (P+? y)y' =0. (1.4.14) 
This latter equation is exact, since 
N(t,y) = + t?y => ON (t, y) = 3? + Qty, 
M(t, y) = 3t?y + ty? => Oy M(t, y) = 3t? + Qty, 


so we get the exactness condition 0,N —s a,M . The solution y can be found as we did in the 
previous examples in this Section. That is, we find the potential function w by integrating 
the equations 


dyb(t,y) = N(t,y), (1.4.15) 


w(t, y) = M(t, y). (1.4.16) 


From the first equation above we obtain 


wya=P+Py = vty)= Gs + ty) dy + g(t). 

Integrating on the right hand side above we arrive to 
b(t,y) = ty + sey + g(t). 

Introduce the expression above for ~ in Eq. (1.4.16), 

3t?y + ty? + g(t) = v(t, y) = M(t,y) = 3t?y + ty”, 

g(t) = 0. 
A solution to this last equation is g(t) = 0. So we get a potential function 
w(t,y) =e + ae 
All solutions y to the differential equation in (1.4.13) satisfy the equation 
Py(t) +5 P(u(d)? =e, 

where ¢ € R is arbitrary. <J 


We have seen in Example 1.4.2 that linear differential equations with a 4 0 are not 
exact. In Section 1.2 we found solutions to linear equations using the integrating factor 
method. We multiplied the linear equation by a function that transformed the equation 
into a total derivative. Those calculations are now a particular case of Theorem 1.4.5, as 
we can see it in the following Example. 


Example 1.4.10. Use Theorem 1.4.5 to find all solutions to the linear differential equation 
y =a(t)y + 0(0), a(t) 4 0. (1.4.17) 
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Solution: We first write the linear equation in a way we can identify functions N and M, 


y’ — (a(t) yt b(t) = 0. 

We now verify whether the linear equation is exact or not. Actually, we have seen in 
Example 1.4.3 that this equation is not exact, since 

N(t,y) =1 => O.N(t, y) = 0, 

M(t,y) = —a(t) y — (t) = dyM(t,y) = —a(t). 
But now we can go further, we can check wheteher the condtion in Theorem 1.4.5 holds or 
not. We compute the function 

Oy M(t, y) _ ON (t, y) = —a(t) —9 = a(t) 
N(t,y) 1 


and we see that it is independent of the variable y. Theorem 1.4.5 says that we can transform 
the linear equation into an exact equation. We only need to multiply the linear equation by 
a function py, solution of the equation 


H(t) =-al)alt) = n= 4, a= f a(tat. 


This is the same integrating factor we discovered in Section 1.2. Therefore, the equation 
below is exact, 


e AM y! — (a(t) e“4© y — b(t) e- 4M) = 0. (1.4.18) 
This new version of the linear equation is exact, since 
N(t,y) = e740 > aN (t, y) = —a(t) e~4, 
M(t, y) = —a(t) e~4 y — b(t) eW 4 => O,M (t,y) = —a(t) e7 4, 


Since the linear equation is now exact, the solutions y can be found as we did in the previous 
examples in this Section. We find the potential function w integrating the equations 


dy(t,y) = N(t,y), (1.4.19) 


Ow(t, y) = M(t, y). (1.4.20) 


From the first equation above we obtain 
Eee" = aay) = fe dy + g(t). 


The integral is simple, since e~4 is y independent. We then get 
b(t,y) =e A y + g(t). 
We introduce the expression above for y in Eq. (1.4.16), 
—a(t) eA y + g'(t) = Oev(t,y) = M(t,y) = ~alt) eA y — H(4) 4, 
g' (t) = —b(t) ee“ 4M, 
A solution for function g is then given by 


g(t) = —- / b(t) ee 4© dt. 


Having that function g, we get a potential function 


ot,y) =e AMy - [ eo at. 
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All solutions y to the linear differential equation in (1.4.17) satisfy the equation 


eA) ai h— oo e Al) dt = co, 


where c) € R is arbitrary. This is the implicit form of the solution, but in this case it is 
simple to find the explicit form too, 


y(t) = eA (c + / b(t) eA dt). 


This expression agrees with the one in Theorem 1.2.3, when we studied linear equations. <J 


1.4.4. The Equation for the Inverse Function. Sometimes the equation for a 
function y is neither exact nor semi-exact, but the equation for the inverse function y7! 
might be. We now try to find out when this can happen. To carry out this study it is more 
convenient to change a little bit the notation we have been using so far: 


(a) We change the independent variable name from t to «. Therefore, we write differential 
equations as 


pW 


N(z,y)y¥ +M(e,y)=0, y=y(z), y = ae 


(b) We denote by x(y) the inverse of y(x), that is, 


Gy)=2r & Y(t)=y- 
(c) Recall the identity relating derivatives of a function and its inverse function, 
1 
y' (2) 
Our first result says that for exact equations it makes no difference to solve for y or its 
inverse x. If one equation is exact, so is the other equation. 


x'(y) = 


Theorem 1.4.6. Ny'+M=0 isezact @& Ma2'’+N=0 is exact. 


Remark: We will see that for semi-exact equations there is a difference. 
Proof of Theorem 1.4.6: Write the differential equation of a function y with values y(), 
N(z,y)y+M(a,y)=0 and 0,N =d,M. 


If a solution y is invertible we denote y~'(y) = 2(y), and we have the well-known relation 


Divide the differential equation above by y’ and use the relation above, then we get 
N(x,y) + M(x,y) a’ =0, 


where now y is the independent variable and the unknwon function is 2, with values «(y), 
and the prime means x’ = dx/dy. The condition for this last equation to be exact is 


dyM =a,N, 


which is exactly the same condition for the equation Ny’ + M = 0 to be exact. This 
establishes the Theorem. 
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Remark: Sometimes, in the literature, the equations Ny’ + M=0and N+ M2’ =0 are 
written together as follows, 
Ndy+ Mdz=0. 
This equation deserves two comments: 
(a) We do not use this notation here. That equation makes sense in the framework of 
differential forms, which is beyond the subject of these notes. 
(b) Some people justify the use of that equation outside the framework of differential forms 
by thinking y! = - as real fraction and multiplying N y’+M = 0 by the denominator, 


d 
N= +M=0 => Ndy+Mdzr=0. 


me: i 3 ' : 
Unfortunately, y’ is not a fraction = so the calculation just mentioned has no meaning. 
v 


So, if the equation for y is exact, so is the equation for its inverse x. The same is not 
true for semi-exact equations. If the equation for y is semi-exact, then the equation for 
its inverse x might or might not be semi-exact. The next result states a condition on the 
equation for the inverse function x to be semi-exact. This condition is not equal to the 
condition on the equation for the function y to be semi-exact. Compare Theorems 1.4.5 and 
1.4.7. 


Theorem 1.4.7. If the equation 


Maz’ +N=0 
is not exact, with OyM # 0,N, the function M 40, and where the function € defined as 
Fe — (yM — 0,N) 
M 


depends only on y, not on x, then the equation below is exact, 
(e°M)a’ + (e“N) =0 


where L is an antiderivative of @, 


L(y) = ; &y) dy. 


Remarks: 


(a) The function p(y) = e*™) is called an integrating factor. 
(b) Any integrating factor pu is solution of the differential equation 


u(y) = ey) wy). 
(c) Multiplication by an integrating factor transforms a non-exact equation 
Ma'+N=0 
into an exact equation. 
(uM) a’ + (uN) = 0. 

Verification Proof of Theorem 1.4.7: We need to verify that the equation is exact, 

(e& M)2'+(eXN)=0 => M(a,y)=e*™ M(z,y), N(a,y) =e*™ N(z,y). 
We now check for exactness, and let us recall 0,(e”) = (e”)! = fe”, then 


QM =le’M+e* QM, O,N =e" A,N. 
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Let us use the definition of @ in the first equation above, 


(d,M — dN) 
M 


OyM = e% (— M +0,M) =e! 0,N =0,N. 


So the equation is exact. This establishes the Theorem. 


Constructive Proof of Theorem 1.4.7: The original differential equation 
Mz'+N=0 
is not exact because 0,M # 0,N. Now multiply the differential equation by a nonzero 
function y that depends only on y, 
(uM) x' + (uN) = 0. 
We look for a function 4 such that this new equation is exact. This means that j must 
satisfy the equation 
Oy(uM) = 02(UN). 
Recalling that 4 depends only on y and denoting 0,0 = pw’, we get 
wM+pM=p0,N => pM=-p(0,M —O,N). 
So the differential equation (uM) 2’ + (uN) = 0 is exact iff holds 
j (Ge — 0,N ) 
b>=—\— yp 
M 
The solution jz will depend only on y iff the function 
Oy M(x, y) — O2N(a,y 
M(x, y) 
depends only on y. If this happens, as assumed in the hypotheses of the theorem, then we 
can solve for ps as follows, 


Therefore, the equation below is exact, 
(ec M) a’ +(e” N) =0. 


This establishes the Theorem. 


Example 1.4.11. Find all solutions to the differential equation 
(52 e~¥ + 2cos(3z)) y’ + (5e~¥ — 3sin(3z)) = 0. 


Solution: We first check if the equation is exact for the unknown function y, which depends 
on the variable x. If we write the equation as Ny’ + M =0, with y’ = dy/dz, then 


N(a,y) = 5xe-¥ + 2cos(3z) > O,N (x,y) =5e 4 — 6sin(3z), 
M(a,y) =5e ¥ — 3sin(3z) > OyM (x,y) =—5e¥. 
Since 0;N # O,M, the equation is not exact. Let us check if there exists an integrating 
factor uw that depends only on x. Following Theorem 1.4.5 we study the function 


— (0,M — 0,N) _ —10e7¥ + 6sin(3z2) 
N 5xe—¥ + 2cos(3x) ’ 
which is a function of both x and y and cannot be simplified into a function of x alone. 
Hence an integrating factor cannot be function of only x. 


Let us now consider the equation for the inverse function x, which depends on the 
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variable y. The equation is M x’ + N = 0, with x’ = dx/dy, where M and N are the same 
as before, 
M(a,y) =5e ¥ — 3sin(3z) N(a,y) = 5xve-¥ + 2cos(3z). 
We know from Theorem 1.4.6 that this equation is not exact. Both the equation for y and 
equation for its inverse x must satisfy the same condition to be exact. The condition is 
0,N = 0,M, but we have seen that this is not true for the equation in this example. The 
last thing we can do is to check if the equation for the inverse function x has an integrating 
factor 4 that depends only on y. Following Theorem 1.4.7 we study the function 
0, M — 0,N —10e7-" + 6sin(3z 
l= (0y oe ( - ( Ly => ey) =2. 
M (5e-¥ — 3sin(32)) 
The function above does not depend on x, so we can solve the differential equation for pu(y), 
wy) =ey) wy) => wy) = 2H(y) => wy) = Mo e*. 
Since is an integrating factor, we can choose [io = 1, hence p(y) = e?¥. If we multiply the 
equation for x by this integrating factor we get 


eY (5e-¥ — 3sin(3z)) a’ +e (5re¥ + 2cos(3z)) = 0, 
(5e¥ — 3sin(3x) ce”) a’ + (5a e¥ + 2cos(3x) e¥) = 0. 
This equation is exact, because if we write it as Ma’ + N = 0, then 
M(x, y) = 5e¥ — 3sin(3x) e?4 > dy M (x,y) = 5e” — 6sin(3z) e?4, 
N(x, y) = 52 e¥ + 2cos(3x) e¥ > OzN (x,y) = 5e¥ — 6 sin(3x) e?4, 


that is a,M = 0,N. Since the equation is exact, we find a potential function yw from 
a,p=M, Ow=N. 
Integrating on the variable x the equation 0, = M we get 
w(a,y) = 5a e¥ + cos(3zx) e7” + g(y). 
Introducing this expression for w into the equation Oyy = N we get 
5a eY + 2cos(3a) e?¥ + g'(y) = Oy = N = 5a e¥ + 2cos(3a) €74, 
hence g’(y) = 0, so we choose g = 0. A potential function for the equation for x is 
w(x, y) = 5a e¥ + cos(3z) 4. 
The solutions x of the differential equation are given by 
5 a(y) e¥ + cos(3 x2(y)) e4 =. 


Once we have the solution for the inverse function x we can find the solution for the original 
unknown y, which are given by 


5a e¥ + cos(3x) e? 4 = 


J 


Notes. Exact differential equations are studied in Boyce-DiPrima [3], Section 2.6, and in 
most differential equation textbooks. 
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1.4.5. Exercises. 


1.4.1.- Consider the equation 
(1+t?)y’ =—2ty. 


(a) Determine whether the differential 
equation is exact. 

(b) Find every solution of the equation 
above. 


1.4.2.- Consider the equation 


tcos(y) y’ — 2yy’ = —t — sin(y). 

(a) Determine whether the differential 
equation is exact. 

(b) Find every solution of the equation 
above. 


1.4.3.- Consider the equation 


,_ —2-y ety 
= —2y+tety’ 
(a) Determine whether the differential 
equation is exact. 
(b) Find every solution of the equation 


above. 


1.4.4.- Consider the equation 


(6x” — vy) + (—a* + ay”)y’ = 0, 


with initial condition y(0) = 1. 


(a) Find an integrating factor yu that 
converts the equation above into an 
exact equation. 

(b) Find an implicit expression for the 
solution y of the IVP. 


EQUATIONS 


1.4.5.- Consider the equation 


(2xy { +) y 4 Ary” = 0, 


with initial condition y(0) = —2. 


(a) 


(b) 
(c) 


Find an integrating factor uw that 
converts the equation above into an 
exact equation. 

Find an implicit expression for the 
solution y of the IVP. 

Find the explicit expression for the 
solution y of the IVP. 


1.4.6.- Consider the equation 


(c) 


—3x e774 + sin(5x)) y’ 
Bert 5cos(5a)) = 0. 


Is this equation for y exact? If not, 
does this equation have an integrat- 
ing factor depending on x? 

Is the equation for x = y~ exact? 
If not, does this equation have an 
integrating factor depending on y? 
Find an implicit expression for all 
solutions y of the differential equa- 
tion above. 


1 


1.4.7.- * Find the solution to the equation 
Qt? yt+2t7y? +14 (#2 +20? y+2ty) y’ = 0, 


with initial condition 


y(1) = 2. 
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1.5. Applications of Linear Equations 


Different physical systems may be described by the same mathematical structure. The 
radioactive decay of a substance, the cooling of a material, or the salt concentration on 
a water tank can be described with linear differential equations. A radioactive substance 
decays at a rate proportional to the substance amount at the time. Something similar 
happens to the temperature of a cooling body. Linear, constant coefficients, differential 
equations describe these two situations. The salt concentration inside a water tank changes 
in the case that salty water is allowed in and out of the tank. This situation is described 
with a linear variable coefficients differential equation. 


1.5.1. Exponential Decay. An example of exponential decay is the radioactive decay 
of certain substances, such as Uranium-235, Radium-226, Radon-222, Polonium-218, Lead- 
214, Cobalt-60, Carbon-14, etc. These nuclei break into several smaller nuclei and radiation. 
The radioactive decay of a single nucleus cannot be predicted, but the decay of a large 
number can. The rate of change in the amount of a radioactive substance in a sample is 
proportional to the negative of that amount. 

Definition 1.5.1. The exponential decay equation for N with decay constant k > 0 is 


N=—-EN. 


Remark: The equation N’ =k N, with k > 0 is called the exponential growth equation. 
g 


We have seen in § 1.1 how to solve this equation. But we review it here one more time. 


Theorem 1.5.2 (Exponential Decay). The solution N of the exponential decay equation 
N'=-—EN and intial condition N(0) = No is 


Nia”. 


Proof of Theorem 1.5.2: The differential equation above is both linear and separable. 


We choose to solve it using the integrating factor method. The integrating factor is e”, 


(Ni+kN)e*=0 => (eMN)'=0 = eN=c, cER. 
The initial condition No = N(0) =c, so the solution of the initial value problem is 
N(t) =Noe—™. 


This establishes the Theorem. 


Remark: Radioactive materials are often characterized not by their decay constant k but 
by their half-life 7. This is a time it takes for half the material to decay. 


Definition 1.5.3. The half-life of a radioactive substance is the time T such that 


There is a simple relation between the material constant and the material half-life. 


Theorem 1.5.4. A radioactive material constant k and half-life rT are related by the equation 


kr = In(2), 
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Proof of Theorem 1.5.4: We know that the amount of a radioactive material as function 
of time is given by 


NijaNe™ 
Then, the definition of half-life implies, 
N 1 
a =Ne*" => -kr= in(5) => kr=In(2). 


This establishes the Theorem. 


Remark: A radioactive material, N, can be expressed in terms of the half-life, 


gi-t/7)] 


N(t)=Noe-/ 2) => Nt) = Nel = N(t) = No2-*/". 


From this last expression is clear that for t = 7 we get N(r) = N)/2. 


1.5.2. Carbon-14 Dating. Carbon-14 is a radioactive isotope of Carbon-12. An atom 
is an isotope of another atom if their nuclei have the same number of protons but different 
number of neutrons. The Carbon atom has 6 protons. The stable Carbon atom has also 6 
neutrons, so it is called Carbon-12. Carbon-13 is another stable isotope of Carbon having 7 
neutrons. Carbon-14 has 8 neutrons and it happens to be radioactive with half-life 7 = 5730 
years. The Carbon on Earth is made up of 99% of Carbon-12 and almost 1% of Carbon-13. 
Carbon-14 is very rare, in the atmosphere there is 1 Carbon-14 atom per 10!* Carbon-12 
atoms. 

Carbon-14 is being constantly created in the upper atmosphere—by collisions of Carbon- 
12 with outer space radiation—in such a way that the proportion of Carbon-14 and Carbon- 
12 in the atmosphere is constant in time. The Carbon atoms are accumulated by living 
organisms in that same proportion. When the organism dies, the amount of Carbon-14 in the 
dead body decays while the amount of Carbon-12 remains constant. The proportion between 
radioactive over normal Carbon isotopes in the dead body decays in time. Therefore, one 
can measure this proportion in old remains and then find out how old are such remains—this 
is called Carbon-14 dating. 


Example 1.5.1. Bone remains in an ancient excavation site contain only 14% of the 
Carbon-14 found in living animals today. Estimate how old are the bone remains. Use 
that the half-life of the Carbon-14 is tT = 5730 years. 


Solution: Suppose that t = 0 is set at the time when the organism dies. If at the present 
time t, the remains contain 14% of the original amount, that means 
14 
N(t,) = T00 N(0O). 
Since Carbon-14 is a radioactive substance with half-life 7, the amount of Carbon-14 decays 
in time as follows, 
N(i) = N(0)2-*/", 


where T = 5730 years is the Carbon-14 half-life. Therefore, 


14 t 
—t1/T = pace = — 
2 100 - log,(14/100) =  t, =7log,(100/14). 
We obtain that t; = 16,253 years. The organism died more that 16,000 years ago. <J 


Solution: (Using the decay constant k.) We write the solution of the radioactive decay 
equation as 
N(t) = N(0)e™, kr = In(2). 
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Write the condition for ¢,, to be 14 % of the original Carbon-14, as follows, 


14 14 14 
Ne = NO) = eH =7 = -kt, =In(—) 
Oe i007) ° 100 o> tog 
1 100 : : ‘ , 
so, ty = k in(=-). Recalling the expression for & in terms of 7, that is kr = In(2), we get 
l 
f= 7 in(00/14) 
In(2) 
We get t, = 16,253 years, which is the same result as above, since 
In(100/14 
nation 


In(2) — 
dq 


1.5.3. Newton’s Cooling Law. In 1701 Newton published, anonymously, the result 
of his home made experiments done fifteen years earlier. He focused on the time evolution 
of the temperature of objects that rest in a medium with constant temperature. He found 
that the difference between the temperatues of an object and the constant temperature of 
a medium varies geometrically towards zero as time varies arithmetically. This was his way 
of saying that the difference of temperatures, AT’, depends on time as 


(AT)(t) = (AT) e~/”, 


for some initial temperature difference (AT), and some time scale 7. Although this is called 
a “Cooling Law”, it also describes objects that warm up. When (AT), > 0, the object is 
cooling down, but when (AT), < 0, the object is warming up. 

Newton knew pretty well that the function AT above is solution of a very particular 
differential equation. But he chose to put more emphasis in the solution rather than in the 
equation. Nowadays people think that differential equations are more fundamental than 
their solutions, so we define Newton’s cooling law as follows. 


Definition 1.5.5. The Newton cooling law says that the temperature T at a time t of a 
material placed in a surrounding medium kept at a constant temperature T, satisfies 

(AT) =—k(AT), 
with AT(t) = T(t)—Ts, andk > 0, constant, characterizing the material thermal properties. 
Remark: Newton’s cooling law for AT is the same as the radioactive decay equation. 


But now the initial temperature difference, (AT')(0) = T(0) — T;, can be either positive or 
negative. 


Theorem 1.5.6. The solution of Newton’s cooling law equation (AT)’ = —k(AT) with 
initial data T(0) = Tp is 
T(t) =(%—T.)e™ +73. 


Proof of Theorem 1.5.6: Newton’s cooling law is a first order linear equation, which we 
solved in § 1.1. The general solution is 


(AT)\(t)=ce* => T(t)=ce"+T,, ceER, 
where we used that (AT)(t) = T(t) — T;. The initial condition implies 
Ty=T(0)=c+T, > c=h-T; > TtH=(h-T.)e"4+T,. 
This establishes the Theorem. 
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Example 1.5.2. A cup with water at 45 C is placed in the cooler held at 5 C. If after 2 
minutes the water temperature is 25 C, when will the water temperature be 15 C? 


Solution: We know that the solution of the Newton cooling law equation is 
Tha = The" 27, 
and we also know that in this case we have 
Ty = 45, T= 5, T(2) = 25. 


In this example we need to find t, such that T(t,) = 15. In order to find that t, we first 
need to find the constant k, 


T(t) =(45-5)e "45 => T(t)=40e°" 45. 
Now use the fact that T(2) = 25 C, that is, 


1 
20=T(2)=40e7* => In(i/2)=-2k => k= 5 In(2). 
Having the constant k we can now go on and find the time ¢, such that T(¢,) = 15 C. 


T(t) =40e%° "V2 +5 s+ 10=40e%™2) S&S 424. 2 

1.5.4. Mixing Problems. We study the system pictured in Fig. 3. A tank has a salt 
mass Q(t) dissolved in a volume V(t) of water at a time t. Water is pouring into the tank 
at a rate r;(t) with a salt concentration q;(t). Water is also leaving the tank at a rate ro(t) 
with a salt concentration q(t). Recall that a water rate r means water volume per unit 
time, and a salt concentration g means salt mass per unit volume. 


We assume that the salt entering in the tank 
gets instantaneously mixed. As a consequence 
the salt concentration in the tank is homoge- 
neous at every time. This property simplifies 
the mathematical model describing the salt in 
the tank. 

Before stating the problem we want to solve, 
we review the physical units of the main fields 
involved in it. Denote by [r;] the units of the 
quantity r;. Then we have 


Tank 
Instantaneously mixed 


V(t) Q@) Tos do(t) 


ye _ Volume iii Mass 
a) = Pal = Tine 2 A et Sine’ FiGuRE 3. Description of a 
[V] = Volume, [Q] = Mass. water tank problem. 


Definition 1.5.7. A Mixing Problem refers to water coming into a tank at a rate r; with 
salt concentration q;, and going out the tank at a rate ro and salt concentration qo, so that 
the water volume V and the total amount of salt Q, which is instantaneously mixed, in the 
tank satisfy the following equations, 


V'(t) = ri(t) — ro(t), 
Q'(t) = 4 (t) Gi (t) ~ ro(t), qo(t), i 
qo(t) = a (1.5.3) 
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The first and second equations above are just the mass conservation of water and salt, 
respectively. Water volume and mass are proportional, so both are conserved, and we 
chose the volume to write down this conservation in Eq. (1.5.1). This equation is indeed 
a conservation because it says that the water volume variation in time is equal to the 
difference of volume time rates coming in and going out of the tank. Eq. (1.5.2) is the salt 
mass conservation, since the salt mass variation in time is equal to the difference of the salt 
mass time rates coming in and going out of the tank. The product of a water rate r times a 
salt concentration g has units of mass per time and represents the amount of salt entering or 
leaving the tank per unit time. Eq. (1.5.3) is the consequence of the instantaneous mixing 
mechanism in the tank. Since the salt in the tank is well-mixed, the salt concentration is 
homogeneous in the tank, with value Q(t)/V(t). Finally the equations in (1.5.4) say that 
both rates in and out are time independent, hence constants. 


Theorem 1.5.8. The amount of salt in the mixing problem above satisfies the equation 
Q(t) = a(t) Q(t) + b(t), (1.5.5) 
where the coefficients in the equation are given by 
To 


a(t) = ae b(é) = rz ai(¢). (1.5.6) 


Proof of Theorem 1.5.8: The equation for the salt in the tank given in (1.5.5) comes 
from Eqs. (1.5.1)-(1.5.4). We start noting that Eq. (1.5.4) says that the water rates are 
constant. We denote them as r; and rg. This information in Eq. (1.5.1) implies that V’ is 
constant. Then we can easily integrate this equation to obtain 


V(t) = (ri — ro) t+ Vo, (1.5.7) 


where Vo = V(0) is the water volume in the tank at the initial time t = 0. On the other 
hand, Eqs.(1.5.2) and (1.5.3) imply that 


Q(t) = ri ait) — 


Since V(t) is known from Eq. (1.5.7), we get that the function Q must be solution of the 


differential equation 
To 


‘O=ralt t). 
Q(t) = reall) — pe 
This is a linear ODE for the function Q. Indeed, introducing the functions 
io 
t) = b(t) = ri q(t), 
a(t) =e Ul =railt 
the differential equation for Q has the form 
Q(t) = a(t) Q(t) + b(t). 


This establishes the Theorem. 

We could use the formula for the general solution of a linear equation given in Section 1.2 
to write the solution of Eq. (1.5.5) for Q. Such formula covers all cases we are going to 
study in this section. Since we already know that formula, we choose to find solutions in 
particular cases. These cases are given by specific choices of the rate constants r;, To, the 
concentration function q;, and the initial data constants Vj and Q, = Q(0). The study of 
solutions to Eq. (1.5.5) in several particular cases might provide a deeper understanding of 
the physical situation under study than the expression of the solution Q in the general case. 
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Example 1.5.3 (General Case for V(t) = Vo). Consider a mixing problem with equal 
constant water rates r; =r, = 7, with constant incoming concentration q;, and with a given 
initial water volume in the tank Vo. Then, find the solution to the initial value problem 


Q'(t) = a(t) Q(t) +b), Q(0) = Qo, 


where function a and 0 are given in Eq. (1.5.6). Graph the solution function Q for different 
values of the initial condition Qo. 


Solution: The assumption r; = ro = Tr implies that the function a is constant, while the 
assumption that q; is constant implies that the function 6 is also constant too, 


(t) " = ee 

= a =-—-—- —@, 
i (ri —To)t + Vo y 
b(t) = ri a(t) > b(t) = 175 q; = bo. 


Then, we must solve the initial value problem for a constant coefficients linear equation, 
Q'(t) = ao Q(t) + bo, Q(0) = Qo, 


The integrating factor method can be used to find the solution of the initial value problem 
above. The formula for the solution is given in Theorem 1.1.4, 


Ott) = (Qo + *) ert, 


ao 


In our case the we can evaluate the constant bo/ao, and the result is 


b Vv b 
— = (rai) (-—) => -—=aqiho. 
Ao Ts Ao 
Then, the solution @ has the form, 
Q(t) = (Qo — GiVo) e""/ + giVo. (1.5.8) 


The initial amount of salt Q> in the tank can be any non-negative real number. The solution 
behaves differently for different values of Qo. We classify these values in three classes: 


(a) The initial amount of salt in the tank is 
the critical value Qo = qiVo. In this case 
the solution Q remains constant equal to 
this critical value, that is, Q(t) = qiVo. 

(b) The initial amount of salt in the tank is 
bigger than the critical value, Qo > qiVo. 
In this case the salt in the tank Q de- qiVo 
creases exponentially towards the critical 
value. 

(c) The initial amount of salt in the tank is 
smaller than the critical value, Qy < qVo. 

In this case the salt in the tank Q in- t 


creases exponentially towards the critical 
value. FIGURE 4. The function Q 


in (1.5.8) for a few values of 


The graphs of a few solutions in these three 
ened the initial condition Qo. 


classes are plotted in Fig. 4. 
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Example 1.5.4 (Find a particular time, for V(t) = Vo). Consider a mixing problem with 
equal constant water rates r; = ro = r and fresh water is coming into the tank, hence 
q: = 0. Then, find the time ¢, such that the salt concentration in the tank Q(t)/V(t) is 1% 
the initial value. Write that time t, in terms of the rate r and initial water volume VY. 


Solution: The first step to solve this problem is to find the solution Q of the initial value 
problem 
Q'(t) = a(t) Q(t) + Bt), Q(0) = Qo, 
where function a and b are given in Eq. (1.5.6). In this case they are 
To r 
at) =-— = a(t) =- 
b(t) = ri a(t) > b(t) = 0. 


The initial value problem we need to solve is 
r 
Q()=-LaQl), — Q0) = au. 
10) 
From Section 1.1 we know that the solution is given by 


OO =— Que", 


We can now proceed to find the time t,. We first need to find the concentration Q(t)/V(t). 
We already have Q(t) and we now that V(t) = Vo, since rj; = ro. Therefore, 


QW) _ Qt) _ Q ni 


Vit) YW VW 
The condition that defines t, is 


Qs) __ 1 Q 
V(t,) 100 Y 
From these two equations above we conclude that 


1 Qo _ Q(t) _ Qo ert /Vo 
100 ¥% V(t) Vo ; 


The time t, comes from the equation 


1 Ae lvs 1 rt, rt, 
ag In( ie In(100) = —*. 
100° "\ 100 V, Oe aaa 

The final result is given by 
: 
t,; = — In(100) 
Ec 


dq 


Example 1.5.5 (Nonzero q;, for V(t) = Vo). Consider a mixing problem with equal con- 
stant water rates r; = ro = r, with only fresh water in the tank at the initial time, hence 
Qo = 0 and with a given initial volume of water in the tank Vo. Then find the function salt 
in the tank Q if the incoming salt concentration is given by the function 


qi(t) = 2+ sin(2t). 


Solution: We need to find the solution Q to the initial value problem 


Q(t) = a(t) Q(t) + (4), Q(0) = 0, 
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where function a and b are given in Eq. (1.5.6). In this case we have 
(t) 7 > ()=-3 

a So a — fe) 
(Tr; —To)t+Vo iA 

b(t) = ri Gilt) = b(t) = r [2 + sin(2t)]. 


We are changing the sign convention for a) so that ad) > 0. The initial value problem we 
need to solve is 


—Ao, 


Q'(t) = —ao Q(t) + b(t), Q(0) = 0. 


The solution is computed using the integrating factor method and the result is 


t 
Q(t) = ey e°°*b(s) ds, 


where we used that the initial condition is Q) = 0. Recalling the definition of the function 
b we obtain 


Oo)= ey e%*2 + sin(2s)] ds. 


This is the formula for the solution of the problem, we only need to compute the integral 
given in the equation above. This is not straightforward though. We start with the following 
integral found in an integration table, 


ks 


ks a: € 7 
Je sin(Is) ds = aa [ksin(Is) — I cos(Is)], 


where & and / are constants. Therefore, 


[ e%*/2 + sin(2s)| ds = z 
0 


Ao 


t 


2 


t eos 


ags 
© az + 22 


[ao sin(2s) — 2 cos(2s)] 


T 
0 


Ors bss oo 2 
= ate %_1)4 +e [ao sin(2t) — 2cos(2t)] + eae 
With the integral above we can compute the solution Q as follows, 
2 eot 2 
_ ,—-aot aot I . 
Q(t) = o-0'[ = (e00* — 1) + pp [ao sin(24) — 2c08(28)] + Soon], 
recalling that a) = r/Vo. We rewrite expression above as follows, 
2 : 27 cas 1 
=~ + lay wo + gag ge Mosin(24) — 2c0s(24)]. (1.5.9) 


FIGURE 5. The graph of the function Q given in Eq. (1.5.9) for ap = 1. 
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1.5.5. Exercises. 


1.5.1.- A radioactive material decays at 


a rate proportional to the amount 

present. Initially there are 50 mil- 

ligrams of the material present and after 

one hour the material has lost 80% of its 

original mass. 

(a) Find the mass of the material as 
function of time. 

(b) Find the mass of the material after 
four hours. 

(c) Find the half-life of the material. 


1.5.2.- A vessel with liquid at 18 C is placed 


in a cooler held at 3 C, and after 3 min- 
utes the temperature drops to 13 C. 


(a) Find the differential equation satis- 
fied by the temperature T of a liq- 
uid in the cooler at time t = 0. 

(b) Find the function temperature of 
the liquid once it is put in the 
cooler. 

(c) Find the liquid cooling constant. 


1.5.3.- A tank initially contains Vo = 100 


liters of water with Qo = 25 grams of 
salt. The tank is rinsed with fresh wa- 
ter flowing in at a rate of r; = 5 liters 
per minute and leaving the tank at the 
same rate. The water in the tank is well- 
stirred. Find the time such that the 
amount the salt in the tank is Qi = 5 
grams. 


1.5.4.- A tank initially contains Vo = 100 


liters of pure water. Water enters the 
tank at a rate of r; = 2 liters per minute 
with a salt concentration of qi = 3 
grams per liter. The instantaneously 
mixed mixture leaves the tank at the 
same rate it enters the tank. Find the 
salt concentration in the tank at any 
time t > 0. Also find the limiting 
amount of salt in the tank in the limit 
t> co. 


1.5.5.- A tank with a capacity of V,, = 500 


liters originally contains Vo = 200 liters 
of water with Qo = 100 grams of salt 
in solution. Water containing salt with 
concentration of qi = 1 gram per liter 
is poured in at a rate of r; = 3 liters 
per minute. The well-stirred water is 
allowed to pour out the tank at a rate 
of ro = 2 liters per minute. Find the 
salt concentration in the tank at the 
time when the tank is about to overflow. 
Compare this concentration with the 
limiting concentration at infinity time 
if the tank had infinity capacity. 
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1.6. Nonlinear Equations 


Linear differential equations are simpler to solve than nonlinear differential equations. While 
we have an explicit formula for the solutions to all linear equations—Theorem 1.2.3—there 
is no such formula for solutions to every nonlinear equation. It is true that we solved several 
nonlinear equations in §§ 1.2-1.4, and we arrived at different formulas for their solutions, 
but the nonlinear equations we solved are only a tiny part of all nonlinear equations. 

One can give up on the goal of finding a formula for solutions to all nonlinear equations. 
Then, one can focus on proving whether a nonlinear equations has solutions or not. This is 
the path followed to arrive at the Picard-Lindelo6f Theorem. This theorem determines what 
nonlinear equations have solutions, but it provides no formula for them. However, the proof 
of the theorem does provide a way to compute a sequence of approximate solutions to the 
differential equation. The proof ends showing that this sequence converges to a solution of 
the differential equation. 

In this section we first introduce the Picard-Lindelof Theorem and the Picard iteration 
to find approximate solutions. We then compare what we know about solutions to linear 
and to nonlinear differential equations. We finish this section with a brief discussion on 
direction fields. 


1.6.1. The Picard-Lindelof Theorem. We will show that a large class of nonlinear 
differential equations have solutions. First, let us recall the definition of a nonlinear equation. 


Definition 1.6.1. An ordinary differential equation y'(t) = f(t, y(t)) is called nonlinear 
iff the function f is nonlinear in the second argument. 


Example 1.6.1. 
(a) The differential equation 
t? 
/ 
y(t) = aH 
y(t) 
is nonlinear, since the function f(t, y) = t?/y? is nonlinear in the second argument. 
(b) The differential equation 


y(t) = 2ty(t) + In(y(4)) 
is nonlinear, since the function f(t, y) = 2ty+In(y) is nonlinear in the second argument, 


due to the term In(y). 
(c) The differential equation 


/ 
t 
y'(t) — 242 
y(t) 
is linear, since the function f(t, y) = 2¢?y is linear in the second argument. 
dq 


The Picard-Lindel6f Theorem shows that certain nonlinear equations have solutions, 
uniquely determined by appropriate initial data. 


Theorem 1.6.2 (Picard-Lindeléf). Consider the initial value problem 


y'(t)=f(t,y(t)),  — y(to) = Yo (1.6.1) 


If the function f is continuous on the domain Dg = [to — a, to + a] X [yo — a, Yo + a] C R?, 
for some a> 0, and f is Lipschitz continuous on y, that is there exists k > 0 such that 


| f(t, y2) ~~ f(t, y1)| < k lye =i 
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for all (t, y2), (t,y1) € Da, then there exists a positive b <a such that there exists a unique 
solution y, on the domain [t, — b, tp + 6], to the initial value problem in (1.6.1). 


Remark: We prove this theorem rewriting the differential equation as an integral equation 
for the unknown function y. Then we use this integral equation to construct a sequence of 
approximate solutions {y,} to the original initial value problem. Next we show that this 
sequence of approximate solutions has a unique limit as n > oo. We end the proof showing 
that this limit is the only solution of the original initial value problem. This proof follows 
[15] § 1.6 and Zeidler’s [16] § 1.8. It is important to read the review on complete normed 
vector spaces, called Banach spaces, given in these references. 


Proof of Theorem 1.6.2: We start writing the differential equation in 1.6.1 as an integral 
equation, hence we integrate on both sides of that equation with respect to t, 


[ voas= | f(s,y(s))ds => uit) =w+ | f(s, y(s)) ds. (1.6.2) 


We have used the Fundamental Theorem of Calculus on the left-hand side of the first 
equation to get the second equation. And we have introduced the initial condition y(to) = Yo. 
We use this integral form of the original differential equation to construct a sequence of 
functions {yp }°2,. The domain of every function in this sequence is Dg = [to — a, to + al. 
The sequence is defined as follows, 


t 
tnra(t) =a + | feuds wet aia. (1.6.3) 


We see that the first element in the sequence is the constant function determined by the 
initial conditions in (1.6.1). The iteration in (1.6.3) is called the Picard iteration. The 
central idea of the proof is to show that the sequence {y,} is a Cauchy sequence in the 
space C(D,) of uniformly continuous functions in the domain Dy = [to — b, to +6] for a small 
enough b > 0. This function space is a Banach space under the norm 


=ma t)|. 
[el] = max |u(¢)| 
See [15] and references therein for the definition of Cauchy sequences, Banach spaces, and 
the proof that C(D,) with that norm is a Banach space. We now show that the sequence 
{yn} is a Cauchy sequence in that space. Any two consecutive elements in the sequence 
satisfy 


omer —aall = paar] f Fls.amls)pas— J £(s.am-al8)) ds 


tED, 


< max i |f(s,um(s)) — £(5,4m—1(s))| ds 


t 
<k mae lYn(s) — Yn—1(s)| ds 
< kb ||¥n = Yn—1\l- 


Denoting r = kb, we have obtained the inequality 


Yn+a — Ynll <r lyn —Yr-ill =  Wlynti- yall <r" Ilys — yoll- 
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Using the triangle inequality for norms and and the sum of a geometric series one compute 
the following, 


Yn _ Urea = lun — Unt+1 oF Yn+1 — Yn+2 Spee Se Yn+(m—1) — Yn+ml 


UX 


Il — Yn+tl| + [Yn+1 — Yngall +++ + [lYn+(m—1) — Yn+ml 
(r? 4 rPh gp p™™) Na — yo 
re(L4tr4r74---+r™) lly. — yoll 


1-—r™ 
r (———) Ila — voll 


Now choose the positive constant b such that b < min{a,1/k}, hence 0 < r < 1. In this case 
the sequence {y,,} is a Cauchy sequence in the Banach space C(D,), with norm || ||, hence 
converges. Denote the limit by y = limp... yn. This function satisfies the equation 


aes if f(s, y(s)) ds 


which says that y is not only continuous but also differentiable in the interior of Dy, hence 
y is solution of the initial value problem in (1.6.1). The proof of uniqueness of the solution 
follows the same argument used to show that the sequence above is a Cauchy sequence. 
Consider two solutions y and y of the initial value problem above. That means, 


ee | flsy(s)ds, 9() = 40+ | H(s,G(s) ds 


Therefore, their difference satisfies 


IN IN 


IN 


Iv — al = max [Fs u(e))as - wh “F(s.Gle)) as 


< mips [Lt |f( 8, y(s f(s, 9(s))| ds 


te Dy 


< kaa [il ly(s) s)| ds 


tEDp 
< kb|ly — gl). 
Since b is chosen so that r = kb < 1, we got that 
ly-gi<rly-gl, r<1 => |y-gl=9 > y=y. 
This establishes the Theorem. 


Example 1.6.2. Use the proof of Picard-Lindelof’s Theorem to find the solution to 
y=2y+3  y(0)=1. 


Solution: We first transform the differential equation into an integral equation. 


[ voas= [ eus)+s)ds > wy —u00)= f 2uls) +3)a. 


Using the initial condition, y(0) = 1, 


1.6. NONLINEAR EQUATIONS 63 


We now define the sequence of approximate solutions: 


t 
w=GOSL nailt) = 14 f un (s) +3) ds, n>0. 
0 


We now compute the first elements in the sequence. We said yo = 1, now yj; is given by 


t t 
n=0, n()=1+ [ (2y0(s) +3)ds=1+ f 5ds = 1+ 5t. 
0 0 


So yy = 1+ 5t. Now we compute yo, 
t 


t t 

Yy2 = +f (2 y1(s)+3) ds = +f (2(1+5s)+3)ds > y= +f (5+10s) ds = 1+5t+5t?. 
0 0 0 

So we’ve got yo(t) = 1+5t +5t?. Now ys, 


t t 
w=1+ f yn(s)+3)ds—1+4 f (2 +55+5s*) +3) ds 
0 0 


so we have, 


‘ 10 
wa 1+ f (5+ 105-4 105%) ds = 1456+ 50 + 28 
0 


1 
So we obtained y3(t) = 1+ 5t + 5t? + —t?. We now rewrite this expression so we can get 


a power series expansion that can be written in terms of simple functions. The first step is 
done already, to write the powers of t as t”, for n = 1, 2,3, 


5(2 
y3(t) = 1+ 5t? + 5¢? 4 : ) 43 
We now multiply by one each term so we get the factorials n! on each term 
‘a i i 
y3(t) =1+4+5 Ti 5(2) or 5(27) zy 
We then realize that we can rewrite the expression above in terms of power of (2t), that is, 
BQp 8 (a. a (2a 5 (26)? (2¢)8 
t)=14 =14 ( 2t) 4 
ya(#) a 2 # "9 3 gato si 
From this last expressionis simple to guess the n-th approximation 
5 (24)? (24)8 (2t)% 5 (2t)F 
t)=143((2t ! poseg )=14 
A aC aia) 3! NI! 2 », kl 


Recall now that the power series expansion for the exponential 


© fat)k 2 (atyk © fat\* ; 
ay OM yy = yo = (et). 


One last rewriting of the solution and we obtain 


5» 3 
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Remark: The differential equation y’ = 2y +3 is of course linear, so the solution to the 
initial value problem in Example 1.6.2 can be obtained using the methods in Section 1.1, 


3 3 
e*(y-2y)=e "73 => e“Y==5¢ +e > y(t) = ce" — 5; 
and the initial condition implies 


_ = 3 _ 5 5 ap 8 
1=y(0)=c 5 > c=5 => y(t) = 5e 5° 


Example 1.6.3. Use the proof of Picard-Lindel6of’s Theorem to find the solution to 
y =ayt+b y(0) = Go, a,beER. 


Solution: We first transform the differential equation into an integral equation. 


[ veras= [ (aus) +045 > ul)—ul0) = [ (auls) +b)as. 


Using the initial condition, y(0) = G, 


y(t) = Go +f (ay(s) + b) ds. 


We now define the sequence of approximate solutions: 


t 
vo = (0) = bo, nsilt)=to+ | (aun(s) +8)ds, 30. 
0 


We now compute the first elements in the sequence. We said yo = Jo, now yj is given by 
t 
n=0, mlt)=vo+ [ (ayo(s) +8)as 
0 


t 
=iv+ [ (a % + b) ds 
0 
= Yo + (a {Ho + b)t. 
So yi = Go + (aH + b)t. Now we compute yo, 


t 
y2 =a+ f [a y1(s) + b] ds 
0 


= to +f ale ade tat 8) ae 


n . : at? 
= Yo + (af + b)t + (a Go + Dow 


#2 
So we obtained y2(t) = Go + (aio + b)t + (aH + b> A similar calculation gives us ys, 


a2t3 
3! ¢ 
We now rewrite this expression so we can get a power series expansion that can be written 


in terms of simple functions. The first step is done already, to write the powers of t as t”, 
for n = 1, 2,3, 


= , 2 bop 
ys(t) = Go + (aia + 6) + (aia + 6) + (ag +b) 


1 2 3 
gt 


us(t) = Ge + (ago +0) + (aie +0)a% + (aj + b)a Al 
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We already have the factorials n! on each term t”. We now realize we can write the power 
functions as (at)” is we multiply eat term by one, as follows 
+ (abo +b) (at)! , (ao +6) (at)? | (ao +b) (at)? 


= t 
ys(t) = Yo + a 1! a 2! a 3! 


Now we can pull a common factor 
(ob) (lat)! , (at)? (at)! 
ys(t) = Go (G+ ~) ( es Te ) 


From this last expressionis simple to guess the n-th approximation 


w(t) = + (do+ 2) (GE + SP + SP 4.4 GE) 
gone (oe) Se 


Recall now that the power series expansion for the exponential 


at =~ (at)* . (at)* = cee” a 
_ orate as y= ay 


k=0 k=1 k=1 


Notice that the sum in the exponential starts at k = 0, while the sum in y, starts at k = 1. 
Then, the limit n — oo is given by 


y(t) = lim Yn (Et) 


n> 
=o + (+2) 0 SF 
= t+ (n+ @) (@*-), 


We have been able to add the power series and we have the solution written in terms of 
simple functions. One last rewriting of the solution and we obtain 


u(t) = (io +=) et — 2. 


Remark: We reobtained Eq. (1.1.12) in Theorem 1.1.4. 


Example 1.6.4. Use the Picard iteration to find the solution of 

y=5ty, (0) =1. 
Solution: We first transform the differential equation into an integral equation. 

i y' (s) ds = fe 5sy(s)ds => y(t)—y(0)= [ 58 y(s) ds. 
0 

Using the initial condition, y(0) = 1, 

y(t) =14+ i 5s y(s) ds. 
We now define the sequence of approximate solutions 


t 
yo = y(0) = 1, nail) =1+ f 58 yn(s)ds, n>0. 
0 
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We now compute the first four elements in the sequence. The first one is yo = y(0) = 1, the 
second one y; is given by 


t 
n=0, n()=1+ [ Beds = 1420. 
0 
So y; = 1+(5/2)t?. Now we compute yo, 


yo = 14 


5 Be 
=14+-74—¢ 
2 8 
: Bio B74 8s ; : 
So we obtained yo(t) = 1+ 3° + at . A similar calculation gives us y3, 


t 
Ys =1+ [ 58 yo(s) ds 
0 
‘ 5 Se 
=1+ f 5s (1+ 58° +538") ds 


t 2 3 
=1+/ (se? ere Sas 
0 


2 28 
5 5? be 
=1+=?+—t*+4 i 
2 8 236 
; Lee gee ; : : 
So we obtained y3(t) = 14 5t = 4 743 t’. We now rewrite this expression so we can 


get a power series expansion that can be written in terms of simple functions. The first step 
is to write the powers of t as t”, for n = 1, 2,3, 


5 a1, 5 yao, 5 p23 
ys(t) =1+ 510) + 53 (t ) + 54g tt y°. 
Now we multiply by one each term to get the right facctorials, n! on each term, 
5 (t2)! iH 52 (t2)2 | 58 (t2)3 
2 ae Pee eh 
No we realize that the factor 5/2 can be written together with the powers of ¢?, 


y3(t) =1+ 


5 42 5 42)2 5 42)3 
St st st 
agai¢ 224% y 42 us 
1! 2! 3! 
From this last expression is simple to guess the n-th approximation 


9 


N_ (542)\k 
y(t) =1+ >> or 
k=1 


which can be proven by induction. Therefore, 


| =. Ge) 
y(t) = Jim yw) =1+ D7 =. 
k=1 : 
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Recall now that the power series expansion for the exponential 


so we get 


Remark: The differential equation y’ = 5ty is of course separable, so the solution to the 
initial value problem in Example 1.6.4 can be obtained using the methods in Section 1.3, 


/ Be 5 
2 =e In(y) = oe +c => y(t) =e? 
y 


t2 


so we obtain the solution 


Example 1.6.5. Use the Picard iteration to find the solution of 
y =2i*y, (0) = 1. 


Solution: We first transform the differential equation into an integral equation. 


f = f 264 y(s) ds _ (0) = f 284y(s) ds 
[ veoas= f 2s*u(sya = y(t) —y(0) J 28*ulo)at 


0 


Using the initial condition, y(0) = 1, 
t 
y(t) =1 +f 2s* y(s) ds. 
0 
We now define the sequence of approximate solutions: 
t 
wo =O) =1, ynilt)=1+ f 2styn(s)ds, 30. 
0 


We now compute the first four elements in the sequence. The first one is yo = y(0) = 1, the 
second one y; is given by 


t 
2 
n=0, n()=1+ [ astds=1+ 7. 
0 
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So y, = 1+ (2/5)t?. Now we compute yp, 
t 
yo =1 +f 2s* y(s) ds 
0 


t 
=1+/ as (1+ 255) ds 
0) 5 
t 92 
=1+/ (2s* + — s°) ds 
0 5 


2 2° 1 


ee ee ee 
7 - 5 10 
: Qe, 2? 1 sa Lax é : 
So we obtained yo(t) = 1+ st tat A similar calculation gives us y3, 
t 
w=i+ f 2s* yo(s) ds 
0 
t 
2 2” 1 
=] 254 1 a) Sa elO d 
i a a ete ) ds 
t 2 3 
2 2 gi 
=1+ [ =! ae +a5° ) ds 
14 2p Bl yo B11 gs 
5 5 10 52 2 15 
So we obtained y3(t) 142542 1 po e Ah as, We t der t in this last 
In — } nr 
wi y3 29 5323 e now try reorder terms in this las 


expression so we can get a power series expansion we can write in terms of simple functions 
This is what we do: 
22 (45)? 
53 2 
9 t 92 4 2 93 a 3 
ge 2 OV PCY ee) 
5 1! 52 2! 53 3) 
_,, GP), Ger Gey 
1! 2! 3! 
From this last expression is simple to guess the n-th approximation 


p= 


which can be proven by induction. Therefore, 


93 (eP 


GP) 


y(t) = im y(t) = 1+ d, 


Recall now that the power series expansion for the exponential 


at So (at)® _ So (at* 
a a are 
k=0 k=1 


so we get 
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1.6.2. Comparison of Linear and Nonlinear Equations. The main result in § 1.2 
was Theorem 1.2.3, which says that an initial value problem for a linear differential equation 
y =alt)y+(t), —-y(to.) = Yo, 
with a,b continuous functions on (t,,¢,), and constants ty € (t,,t,) and y € R, has the 

unique solution y on (t,,t2) given by 


t 
y(t) = eA (vo +f e A(3) b(s) ds), 


to 
t 
where we introduced the function A(t) = / a(s) ds. 
to 
From the result above we can see that solutions to linear differential equations satisfiy 


the following properties: 


(a) There is an explicit expression for the solutions of a differential equations. 
(b) For every initial condition y) € R there exists a unique solution. 
(c) For every initial condition yo € R the solution y(t) is defined for all (¢:, tz). 


Remark: None of these properties hold for solutions of nonlinear differential equations. 


From the Picard-Lindelof Theorem one can see that solutions to nonlinear differential 
equations satisfy the following properties: 
(i) There is no explicit formula for the solution to every nonlinear differential equation. 
(ii) Solutions to initial value problems for nonlinear equations may be non-unique when 
the function f does not satisfy the Lipschitz condition. 
(iii) The domain of a solution y to a nonlinear initial value problem may change when we 
change the initial data yp. 
The next three examples (1.6.6)-(1.6.8) are particular cases of the statements in (i)-(iii). 
We start with an equation whose solutions cannot be written in explicit form. 


Example 1.6.6. For every constant a1, a2, a3, a4, find all solutions y to the equation 
t2 
y*(t) + aa y3(t) + a3 y?(t) + a2 y(t) +41) 


y(t) = (1.6.4) 


Solution: The nonlinear differential equation above is separable, so we follow § 1.3 to find 
its solutions. First we rewrite the equation as 

(y(t) + agy®(t) + a3 y7(t) + any(t) +m) y(t) = 8. 
Then we integrate on both sides of the equation, 


[vo + ag y3(t) + a3 y?(t) + ag y(t) + a) y (t) dt = fe dt +e. 
Introduce the substitution u = y(t), so du = y’(t) dt, 
[ets acn’ +03? taguta;)du= f Pat +e. 


Integrate the left-hand side with respect to u and the right-hand side with respect to t. 
Substitute wu back by the function y, hence we obtain 


3 


y(t) 4 + y*(t) 4 > y(t) 4 S y(t) + ay y(t) =5+e 


This is an implicit form for the solution y of the problem. The solution is the root of a 
polynomial degree five for all possible values of the polynomial coefficients. But it has been 
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proven that there is no formula for the roots of a general polynomial degree bigger or equal 
five. We conclude that that there is no explicit expression for solutions y of Eq. (1.6.4). < 


We now give an example of the statement in (ii), that is, a differential equation which 
does not satisfy one of the hypothesis in Theorem 1.6.2. The function f has a discontinuity 
at a line in the (t,u) plane where the initial condition for the initial value problem is given. 
We then show that such initial value problem has two solutions instead of a unique solution. 


Example 1.6.7. Find every solution y of the initial value problem 
y()=y'(t), (0) =0. (1.6.5) 


Remark: The equation above is nonlinear, separable, and f(t,u) = u'/3 has derivative 


lL 
Ouf = 3 a8 


Since the function 0,,f is not continuous at u = 0, it does not satisfies the Lipschitz condition 
in Theorem 1.6.2 on any domain of the form S = [—a,a] x [—a,a] with a > 0. 


Solution: The solution to the initial value problem in Eq. (1.6.5) exists but it is not unique, 
since we now show that it has two solutions. The first solution is 


yi(t) = 0. 
The second solution can be computed as using the ideas from separable equations, that is, 


fol v@ae= farses 


Then, the substitution u = y(t), with du = y(t) dt, implies that 


preau= f dere. 


Integrate and substitute back the function y. The result is 


SOP atte + xt)=([Fe+e)]” 


The initial condition above implies 


0 = y(0) = (Ze) => co=0, 


so the second solution is: 


J 


Finally, an example of the statement in (iii). In this example we have an equation with 
solutions defined in a domain that depends on the initial data. 
Example 1.6.8. Find the solution y to the initial value problem 
y(t)=y'(t), —-y(0) = wo. 


Solution: This is a nonlinear separable equation, so we can again apply the ideas in Sect. 1.3. 
We first find all solutions of the differential equation, 


y' (t) dt 1 1 
| Se = fate => “ae > 2 ae Ee 
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We now use the initial condition in the last expression above, 


1 1 
w=y)=-— = @=-—. 
Co Yo 
So, the solution of the initial value problem above is: 
1 
y(t) = 


(. -#) 


This solution diverges at t = 1/y), so the domain of the solution y is not the whole real line 
R. Instead, the domain is R — {y.}, so it depends on the values of the initial data y. <1 


In the next example we consider an equation of the form y’(t) = f(t, y(t)), where f does 
not satisfy the hypotheses in Theorem 1.6.2. 


Example 1.6.9. Consider the nonlinear initial 
value problem 


! = 1 U 
¥O= EDGES DOW — DWH +3)" | 


y(to) = Yo- (1.6.6) 


Find the regions on the plane where the hypotheses = - -- -- ~~ 
in Theorem 1.6.2 are not satisfied. 


(teu) ; 

,U — ? 
(t —1)((+ 1)(u — 2)(u +3) 

so f is not defined on the lines 


(1.6.7) 


t=1, t=-1, u= 2, u=-—3. 
See Fig. 6. For example, in the case that the initial 4 ’ 
data is t) = 0, yo = 1, then Theorem 1.6.2 implies 
that there exists a unique solution on any region R FIGURE 6. Red regions 
contained in the rectangle R = (—1,1) x (—3, 2). where f in Eq. (1.6.7) is 
If the initial data for the initial value problem in not defined. 
Eq. (1.6.6) is t = 0, yo = 2, then the hypotheses of 
Theorem 1.6.2 are not satisfied. <J 


1 
1 
1 
1 
1 
Solution: In this case the function f is given by: 
1 
i} 
1 
1 
1 


1.6.3. Direction Fields. Sometimes one needs to find information about solutions of 
a differential equation without having to actually solve the equation. One way to do this is 
with the direction fields. Consider a differential equation 


y(t) = f(t, y())- 
We interpret the the right-hand side above in a new way. 


(a) In the usual way, the graph of f is a surface in the tyz-space, where z = f(t, y), 

(b) In the new way, f(t, y) is the value of a slope of a segment at each point (t,y) on the 
ty-plane. 

(c) That slope is the value of y’(t), the derivative of a solution y at ft. 


The ideas above suggest the following definition. 
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Definition 1.6.3. A direction field for the differential equation y'(t) = f(t, y(t)) is the 
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New Way 


4- Values as Sopes 
on the Zj-Pauwe 


FIGURE 7. The function f as a slope of a segment. 


graph on the ty-plane of the values f(t,y) as slopes of a small segments. 


We now show the direction fields of e few equations. 


Example 1.6.10. Find the direction field of the equation y’ = y, and sketch a few solutions 


to the differential equation for different initial conditions. 


Solution: Recall that the solutions are y(t) = yo e’. So is the direction field shown in Fig. 8. 


<q 


; Lobb) 
f ca cA 
x f Pi 
a i ee 
gO GOP! GOP EO GP OG EE GE PF GP PE POP GP PO GE 
BE BPE gh iPM PO gE gO BO GPO GP gO ge PO GB GU gel gg GaP PO PU AO gl seh eth gM 
Fe ad a ad an Ia a a a a a a a a a a a a a 
A A A A A A A A A A A A 


Sp ah a ge eh 
a a se ae ea a a ae a ee a oe ae 
i i i ee i i i a i i a ie i i i i ll i i Ma i Be a i 


Hoo LLL 
Oe Lb 
i eh ob aL 
ALLL LLL 
te ed 
Oe Lobb 
yl fot 
eld do fd 
Me od bd 
Cet foe 
LO LL LL SL 
ALL LL AL 
ee Le hd 
Ai Lo LLL 
Nall ae a ied 
CPL LE LL 
COLL LLL 
CO bo LL 
OL es 
MLL LLL 
Loe LL LL 
ALL LL AS 
wid lee 
kell LL oh 
Alo LLL 


FIGURE 8. Direction field for the equation y’ = y. 
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Example 1.6.11. Find the direction field of the equation y’ = sin(y), and sketch a few 


solutions to the differential equation for different initial conditions. 


The equation is separable so the solutions are 


Solution: 


for any yo € R. The graphs of these solutions are not simple to do. But the direction field 


is simpler to plot and can be seen in Fig. 9. 


<q 


TT EPEPATT ASN S| 
TPEPPP7TINIA\W\\A 1 
TEPTP7ITIVANA\AN 
CE PePF TITAN 
EEL ELLIINGCSS 
Teese e eee 
TEPTP7ITINAA\AN 
VIE ATI 
VITA 
TPPP77TITI\AAAAAA 
VLEET 
TEPT/7PTINAA\AN 
TIPE 
TPPP77TITIWAAAAA 
UT CPOPTIVN Ss 
VIE PATIITVAN 
TEPPP7ITITANA\AN 
TPEPPP7TITIAW\\A( 
TEEPE 
TEPTP7PTITANA\AN 
TEPPP7IITANWA\AN 
DEPTS 
TALL AAERNS' 
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n 
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FIGURE 9. Direction field for the equation y’ = sin(y). 


and sketch 


2 


) 


Example 1.6.12. Find the direction field of the equation y’ = 2 cos(t) cos(y 
a few solutions to the differential equation for different initial conditions. 


dq 


Solution: We do not need to compute the explicit solution of y’ = 2 cos(t) cos(y) to have a 


qualitative idea of its solutions. The direction field can be seen in Fig. 10. 
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FIGURE 10. Direction field for the equation y’ = 2 cos(t) cos(y). 
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1.6.4. Exercises. 


1.6.1.- Use the Picard iteration to find the 
first four elements, yo, y1, y2, and ys, 
of the sequence {yn}? of approximate 
solutions to the initial value problem 


y =6y+1, y(0)=0. 


1.6.2.- Use the Picard iteration to find the 
information required below about the 
sequence {yn }?=o of approximate solu- 
tions to the initial value problem 


y =3y+5, y(0)=1. 
(a) The first 4 elements in the sequence, 


Yo, Y1, Y2, and ¥3- 
(b) The general term cx(t) of the ap- 
proximation 


(c) Find the limit y(t) = limn—co yn (Et). 


1.6.3.- Find the domain where the solution 
of the initial value problems below is 
well-defined. 


—At 
(a) y =—, y(0)=yo > 0. 
(b) y’ = 2ty*, y(0) = yo > 0. 
1.6.4.- By looking at the equation coeffi- 


cients, find a domain where the solution 
of the initial value problem below exists, 


(a) (t?—4) y/+2In(t) y = 3¢, and initial 
condition y(1) = —2. 


(b) y’ = v= 3)’ and initial condition 
y(-1) = 2. 


1.6.5.- State where in the plane with points 
(t,y) the hypothesis of Theorem 1.6.2 


are not satisfied. 


y" 


2t — 3y" 


(a) y= 


CHAPTER 2 


Second Order Linear Equations 


Newton’s second law of motion, ma = f, is maybe one of the first differential equations 
written. This is a second order equation, since the acceleration is the second time derivative 
of the particle position function. Second order differential equations are more difficult to 
solve than first order equations. In § 2.1 we compare results on linear first and second order 
equations. While there is an explicit formula for all solutions to first order linear equations, 
not such formula exists for all solutions to second order linear equations. The most one 
can get is the result in Theorem 2.1.7. In § 2.2 we introduce the Reduction Order Method 
to find a new solution of a second order equation if we already know one solution of the 
equation. In § 2.3 we find explicit formulas for all solutions to linear second order equations 
that are both homogeneous and with constant coefficients. These formulas are generalized 
to nonhomogeneous equations in § 2.5. In § 2.6 we describe a few physical systems described 
by second order linear differential equations. 


77 


78 2. SECOND ORDER LINEAR EQUATIONS 


2.1. Variable Coefficients 


We studied first order linear equations in § 1.1-1.2, where we obtained a formula for all 
solutions to these equations. We could say that we know all that can be known about 
solutions to first order linear equations. However, this is not the case for solutions to second 
order linear equations, since we do not have a general formula for all solutions to these 
equations. 

In this section we present two main results, the first one is Theorem 2.1.2, which says 
that there are solutions to second order linear equations when the equation coefficients are 
continuous functions. Furthermore, these solutions have two free parameters that can be 
fixed by appropriate initial conditions. 

The second result is Theorem 2.1.7, which is the closest we can get to a formula for 
solutions to second order linear equations without sources—homogeneous equations. To 
know all solutions to these equations we only need to know two solutions that are not 
proportional to each other. The proof of Theorem 2.1.7 is based on Theorem 2.1.2 plus 
an algebraic calculation and properties of the Wronskian function, which are derived from 
Abel’s Theorem. 


2.1.1. Definitions and Examples. We start with a definition of second order linear 
differential equations. After a few examples we state the first of the main results, Theo- 
rem 2.1.2, about existence and uniqueness of solutions to an initial value problem in the 
case that the equation coefficients are continuous functions. 


Definition 2.1.1. A second order linear differential equation for the function y is 
y” +a,(t) y’ + ao(t) y = Dt), (2.1.1) 
where a1, do, b are given functions on the interval IC R. The Eq. (2.1.1) above: 


(a) is homogeneous iff the source b(t) = 0 for all t € R; 
(b) has constant coefficients iff a, and ay are constants; 
(c) has variable coefficients iff either a, or ao is not constant. 


Remark: The notion of an homogeneous equation presented here is different from the Euler 
homogeneous equations we studied in § 1.3. 


Example 2.1.1. 
(a) A second order, linear, homogeneous, constant coefficients equation is 
y’ +5y+6=0. 
(b) A second order, linear, nonhomogeneous, constant coefficients, equation is 
y” — 3y' + y = cos(3t). 
(c) A second order, linear, nonhomogeneous, variable coefficients equation is 
y” + 2ty' — In(t)y = e**. 


(d) Newton’s law of motion for a point particle of mass m moving in one space dimension 
under a force f is mass times acceleration equals force, 


my" (t) = f(t, y@), y(t). 


(e) Schrédinger equation in Quantum Mechanics, in one space dimension, stationary, is 


h2 
—a B+ Va) b= Ed, 
m 
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where w is the probability density of finding a particle of mass m at the position x 
having energy FE under a potential V, where f is Planck constant divided by 27. <J 


Example 2.1.2. Find the differential equation satisfied by the family of functions 


y(t) =aqe*+e er, 


where ¢,, C) are arbitrary constants. 


Solution: From the definition of y compute c,, 


At 8t 


C1 =ye ~—G@e- 


Now compute the derivative of function y 
y’ = 4c, e* — 4a, e~*, 
Replace c, from the first equation above into the expression for y’, 
yi =A(ye* — qe *)e# doe * = yl =4y4+(-4-Dae™, 
so we get an expression for c, in terms of y and y’, 


1 
jt > @= gity—¥’) ett 


At this point we can compute c, in terms of y and y’, although we do not need it for what 
follows. Anyway, 


y’ = Ay — 8c, e~4* 


1 1 
aaye™—s(4y—yete = a= s(dytyje™. 


We do not need c, because we can get a differential equation for y from the equation for c. 
Compute the derivative of that equation, 


1 1 
O=4=5l4y—y/el + s(4y'— "Je => A(4y— 9!) + (4y’ —y") = 0 
which gives us the following second order linear differential equation for y, 


y” —16y=0. 


Example 2.1.3. Find the differential equation satisfied by the family of functions 


y(t) == + eat, C1,€. ER. 


Solution: Compute y’ = a! +). Get one constant from y’ and put it in y, 
y rs y y 
oy Cy a 1 Cy 
so we get 
Cc c 2c 
(age Sa 


Compute the constant from the expression above, 
2c 
=y-—ty’ => 24 =ty—ty 


Since the left hand side is constant, 


0 = (2c,)’ =(ty-ty') =y+ty’ —2ty'-Py", 


/ 
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so we get that y must satisfy the differential equation 


By’ +ty' —y=0. 


dq 
Example 2.1.4. Find the differential equation satisfied by the family of functions 
y(z) =a r+e2”, 
where ¢,, C) are arbitrary constants. 
Solution: Compute the derivative of function y 
y (a) = C1 + 2¢2 xz, 
From here it is simple to get c1, 
cy =y —2coz. 
Use this expression for c; in the expression for y, 
/ 
y =(y' —2cga)e+e2*=2y—e2* > = - = 
Got 
To get the differential equation for y we do not need c,, but we compute it anyway, 
/ 
yy 2y 2y 
q=y—-A™—-4)¢=y-2y¥4+— => ag =-y't—. 
LG 2 x x 
The equation for y can be obtained computing a derivative in the expression for c, 
i / / i / 
O=ga4-F_FigF iY oF oF ig & gy’ any +2yH=0. 
c 2g? x x x? x 
dq 


2.1.2. Solutions to the Initial Value Problem. Here is the first of the two main 
results in this section. Second order linear differential equations have solutions in the case 
that the equation coefficients are continuous functions. Since the solution is unique when 
we specify two initial conditions, the general solution must have two arbitrary integration 
constants. 


Theorem 2.1.2 (IVP). If the functions a1, do, b are continuous on a closed interval IC R, 
the constant tp € I, and yo, y: € R are arbitrary constants, then there is a unique solution 
y, defined on I, of the initial value problem 


y" +ar(t)y’+a(t)y=(t), y(t.) = Yo, y(t.) = yr. (2.1.2) 


Remark: The fixed point argument used in the proof of Picard-Lindelof’s Theorem 1.6.2 
can be extended to prove Theorem 2.1.2. 


Example 2.1.5. Find the domain of the solution to the initial value problem 


A(t — 1) 


(t-—1)y" —3ty! 4 (t—3) 


CS t(t 1), y(2) =1, y' (2) =0. 


Solution: We first write the equation above in the form given in the Theorem above, 
3t 4 
1 / — t. 
"GD" "=a" 
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The equation coefficients are defined on the domain 
(—oo, 1) U (1,3) U (3, 00). 
So the solution may not be defined at t = 1 or t = 3. That is, the solution is defined in 
(—oo, 1) or (1,3) or (3, 00). 
Since the initial condition is at tp = 2 € (1,3), then the domain of the solution is 
D= (0,3). 


<q 


2.1.3. Properties of Homogeneous Equations. We simplify the problem with the 
hope to get deeper properties of its solutions. From now on in this section we focus on 
homogeneous equations only. We will get back to non-homogeneous equations in a later sec- 
tion. But before getting into homogeneous equations, we introduce a new notation to write 
differential equations. This is a shorter, more economical, notation. Given two functions 
G1, A, introduce the function LD acting on a function y, as follows, 


L(y) = y” + a,(t) y! + a(t) y. (2.1.3) 


The function L acts on the function y and the result is another function, given by Eq. (2.1.3). 


Example 2.1.6. Compute the operator L(y) = ty” + 2y’ — : y acting on y(t) = ¢°. 
Solution: Since y(t) = t?, then y’(t) = 3t? and y(t) = 6t, hence 

L(t?) = ¢ (6t) + 2(3#?) — = — Le jyode, 
The function L acts on the function y(t) = ¢? and the result is the function L(t?) = 4t?. < 


The function L above is called an operator, to emphasize that D is a function that acts 
on other functions, instead of acting on numbers, as the functions we are used to. The 
operator L above is also called a differential operator, since L(y) contains derivatives of y. 
These operators are useful to write differential equations in a compact notation, since 


y" +ay(t) y! + ao(t)y = f(t) 
can be written using the operator L(y) = y” + ai(t) y’ + ao(t) y as 
L(y) = f. 
An important type of operators are the linear operators. 


Definition 2.1.3. An operator L is a linear operator iff for every pair of functions y,, 
Yo and constants c,, Co holds 


Leys + C2Yo) = L(y) a CoL (Yo). (2.1.4) 


In this Section we work with linear operators, as the following result shows. 


Theorem 2.1.4 (Linear Operator). The operator L(y) = yy” +a,y' +aoy, where a,, a are 
continuous functions and y is a twice differentiable function, is a linear operator. 


82 2. SECOND ORDER LINEAR EQUATIONS 


Proof of Theorem 2.1.4: This is a straightforward calculation: 
L(crys + Coy) = (crys + Coe)!” + as (Crys + Coe)! + Go (Crys + oye). 
Recall that derivations is a linear operation and then reoorder terms in the following way, 
L(crys + Coy2) = (cry! + as cryy + G0 C191) + (Coys) + as Coy + Ao Co¥2). 
Introduce the definition of Z back on the right-hand side. We then conclude that 
Lexy, + Coy2) = 4 L (yr) + L(y). 


This establishes the Theorem. 
The linearity of an operator L translates into the superposition property of the solutions 
to the homogeneous equation L(y) = 0. 


Theorem 2.1.5 (Superposition). If L is a linear operator and y,, y2 are solutions of the 
homogeneous equations L(y,) = 0, L(y.) = 0, then for every constants c,, C, holds 


L(e Yr + C2 Yo) =0. 


Remark: This result is not true for nonhomogeneous equations. 


Proof of Theorem 2.1.5: Verify that the function y = c,y, + Coy. satisfies L(y) = 0 for 
every constants C,, C2, that is, 


L(y) = L(evys + cae) = 1 L(y) + L(Y) = 40+ 20 =0. 


This establishes the Theorem. 
We now introduce the notion of linearly dependent and linearly independent functions. 


Definition 2.1.6. Two functions y,, y. are called linearly dependent iff they are propor- 
tional. Otherwise, the functions are linearly independent. 


Remarks: 
(a) Two functions y;, y2 are proportional iff there is a constant c such that for all t holds 
yi(t) = cy2(t). 
(b) The function y, = 0 is proportional to every other function y2, since holds y, = 0 = 0%. 
The definitions of linearly dependent or independent functions found in the literature 
are equivalent to the definition given here, but they are worded in a slight different way. 


Often in the literature, two functions are called linearly dependent on the interval J iff there 
exist constants c,, C2, not both zero, such that for all t € I holds 


crys (t) + coy2(t) = 0. 
Two functions are called linearly independent on the interval J iff they are not linearly 
dependent, that is, the only constants c, and c, that for all t € I satisfy the equation 
C1Y1(t) + Coyo(t) = 0 


are the constants c, = c, = 0. This wording makes it simple to generalize these definitions 
to an arbitrary number of functions. 


Example 2.1.7. 
(a) Show that y,(t) = sin(t), yo(t) = 2sin(t) are linearly dependent. 
(b) Show that y,(¢) = sin(t), y(t) = tsin(¢) are linearly independent. 
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Solution: 
Part (a): This is trivial, since 2y,(t) — y.(t) = 0. 
Part (b): Find constants c;, c. such that for all t € R holds 
c, sin(t) + ct sin(t) = 0. 
Evaluating at ¢ = 7/2 and t = 37/2 we obtain 


3 
atta =0, at Fe 0 =O G=0: 


We conclude: The functions y,; and y2 are linearly independent. <J 


We now introduce the second main result in this section. If you know two linearly 
independent solutions to a second order linear homogeneous differential equation, then you 
actually know all possible solutions to that equation. Any other solution is just a linear 
combination of the previous two solutions. We repeat that the equation must be homoge- 
neous. This is the closer we can get to a general formula for solutions to second order linear 
homogeneous differential equations. 


Theorem 2.1.7 (General Solution). If y, and y2 are linearly independent solutions of the 
equation L(y) = 0 on an interval I C R, where L(y) = y’ +a.y' + ay, and a,, ay are 
continuous functions on I, then there are unique constants c,, Cy such that every solution y 
of the differential equation L(y) =0 on I can be written as a linear combination 


y(t) = c1 y(t) + C2 y(t). 


Before we prove Theorem 2.1.7, it is convenient to state the following the definitions, 
which come out naturally from this Theorem. 
Definition 2.1.8. 


(a) The functions y, and y. are fundamental solutions of the equation L(y) = 0 iff y:, 
Yo are linearly independent and 


L(y) = 9, L(y2) = 0. 


(b) The general solution of the homogeneous equation L(y) = 0 is a two-parameter family 
of functions Ygen given by 


Ygen(t) = e1 ya(t) + C2 ya(t), 


where the arbitrary constants c,, C, are the parameters of the family, and y,, Y2 are 
fundamental solutions of L(y) = 0. 


Example 2.1.8. Show that y, = e’ and y, = e~?* are fundamental solutions to the equation 


y’ +y' —2y =0. 


Solution: We first show that y, and y, are solutions to the differential equation, since 
L(y.) = yf + yf, — 2y, = e& + e® — 2e* = (1+1-—2)e* =0, 
L(yo) = yf + yh — 2y. = 467" — 277 — 2e-* = (4-2-2)e** = 0. 
It is not difficult to see that y, and yp are linearly independent. It is clear that they are not 


proportional to each other. A proof of that statement is the following: Find the constants 
c, and cy such that 


0=aytoawp=aaetoe” tEeR 3S O=c,e'—-2c,c°% 
¥y ¥: 


84 2. SECOND ORDER LINEAR EQUATIONS 


The second equation is the derivative of the first one. Take t = 0 in both equations, 
0O=a+q, 0=a4a-26 > GQ=Q=0. 


We conclude that y, and y, are fundamental solutions to the differential equation above.< 


Remark: The fundamental solutions to the equation above are not unique. For example, 
show that another set of fundamental solutions to the equation above is given by, 


2 1 1 
y(t) = 30 + ne Yo(t) = 3 (e —e**), 

To prove Theorem 2.1.7 we need to introduce the Wronskian function and to verify 
some of its properties. The Wronskian function is studied in the following Subsection and 
Abel’s Theorem is proved. Once that is done we can say that the proof of Theorem 2.1.7 is 
complete. 

Proof of Theorem 2.1.7: We need to show that, given any fundamental solution pair, 
Y1, Y2, any other solution y to the homogeneous equation L(y) = 0 must be a unique linear 
combination of the fundamental solutions, 


y(t) = cr y(t) + c y2(t), (2.1.5) 


for appropriately chosen constants c,, C2. 

First, the superposition property implies that the function y above is solution of the 
homogeneous equation L(y) = 0 for every pair of constants ¢,, C2. 

Second, given a function y, if there exist constants c,, c. such that Eq. (2.1.5) holds, 
then these constants are unique. The reason is that functions y,, y2 are linearly independent. 
This can be seen from the following argument. If there are another constants ¢,, C. so that 


y(t) = er yi(t) + & w(t), 
then subtract the expression above from Eq. (2.1.5), 
0= (4 —G) yt (G& — &) y% > Cc, —C, = 0, Co —C = 0, 


where we used that yi, yo are linearly independent. This second part of the proof can be 
obtained from the part three below, but I think it is better to highlight it here. 

So we only need to show that the expression in Eq. (2.1.5) contains all solutions. We 
need to show that we are not missing any other solution. In this third part of the argument 
enters Theorem 2.1.2. This Theorem says that, in the case of homogeneous equations, the 
initial value problem 

L(y) = 0, y(to) = di, y' (to) = d,, 
always has a unique solution. That means, a good parametrization of all solutions to the 
differential equation L(y) = 0 is given by the two constants, d,, d, in the initial condition. 
To finish the proof of Theorem 2.1.7 we need to show that the constants c, and c, are also 
good to parametrize all solutions to the equation L(y) = 0. One way to show this, is to 
find an invertible map from the constants d,, d,, which we know parametrize all solutions, 
to the constants c,, C2. The map itself is simple to find, 


dy = C1 yi (to) + C2 Yo(to) 
dy = C1 (to) + C2 Yo (to). 


We now need to show that this map is invertible. From linear algebra we know that this 
map acting on ¢,, Cp is invertible iff the determinant of the coefficient matrix is nonzero, 


Yyi(to) Ya(to) 
Yi(to) Yo(to) 


= 91 (to) Y3(to) = Y; (to) Yo(to) # 0. 


2.1. VARIABLE COEFFICIENTS 85 


This leads us to investigate the function 
Wio(t) = w(t) v(t) — yr (t)ye(t) 
This function is called the Wronskian of the two functions y;, y.. At the end of this section 


we prove Theorem 2.1.13, which says the following: If y,, y. are fundamental solutions of 
L(y) =0 onI CR, then W,,(t) 4 0 on J. This statement establishes the Theorem. 


2.1.4. The Wronskian Function. We now introduce a function that provides im- 
portant information about the linear dependency of two functions y,, y2. This function, W, 
is called the Wronskian to honor the polish scientist Josef Wronski, who first introduced 
this function in 1821 while studying a different problem. 


Definition 2.1.9. The Wronskian of the differentiable functions y,, Y2 is the function 
Wia(t) = ys (t)ya(t) — yi(t)yo(t). 


Remark: Introducing the matrix valued function A(t) = ba y : the Wronskian can 
1 2 
be written using the determinant of that 2 x 2 matrix, W1.(t) = det(A(t)). An alternative 
notation is: Wy, =|" |. 
Yr Ye 


Example 2.1.9. Find the Wronskian of the functions: 
(a) y,(t) = sin(t) and y,(t) = 2sin(t). (Id) 

(b) y(t) = sin(t) and y(t) = tsin(t). (li) 

Solution: 

Part (a): By the definition of the Wronskian: 


y1(t) a sin(t) 2sin(t) 
yi(t) yZ(t)|  jeos(t) 2 cos(t) 


We conclude that W,,(t) = 0. Notice that y, and y, are linearly dependent. 
Part (b): Again, by the definition of the Wronskian: 


W,.(t) = 


| = sin(t)2 cos(t) — cos(t)2 sin(t) 


_ |sin(t) t sin(¢) _ : _ : 
Wt) = eet “Geate) 4 coat sin(t) [sin(t) + t cos(t)] — cos(t)tsin(t). 
We conclude that W,(t) = sin?(t). Notice that y,; and y, are linearly independent. <J 


It is simple to prove the following relation between the Wronskian of two functions and 
the linear dependency of these two functions. 
Theorem 2.1.10 (Wronskian I). If y:, y2 are linearly dependent on ICR, then 
Wi=0 on ZT. 


Proof of Theorem 2.1.10: Since the functions y,, y. are linearly dependent, there exists 
a nonzero constant c such that y:; = cy; hence holds, 

We = Yr Yo — Yi Yo = (CY2) Yo — (Ye)! Yo = 0. 
This establishes the Theorem. 


Remark: The converse statement to Theorem 2.1.10 is false. If Wi.(t) = 0 for all t € J, 
that does not imply that y,; and y2 are linearly dependent. 
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Example 2.1.10. Show that the functions 
yi(t)=#?, and y(t)=|t|t, for teR 


are linearly independent and have Wronskian W;,, = 0. 


Solution: 

First, these functions are linearly independent, since y,(¢) = —y,(t) for t < 0, but 
yi(t) = yo(t) for t > 0. So there is not c such that y,(¢) = cy.(t) for all t € R. 

Second, their Wronskian vanishes on R. This is simple to see, since y,(t) = —y,(t) for 
t <0, then W,, = 0 for t < 0. Since y,(t) = y(t) for t > 0, then W,, = 0 for t > 0. Finally, 
it is not difficult to see that W,.(t = 0) = 0. <J 


Remark: Often in the literature one finds the negative of Theorem 2.1.10, which is equiv- 
alent to Theorem 2.1.10, and we summarize ibn the followng Corollary. 


Corollary 2.1.11 (Wronskian I). If the Wronskian W12(to) #0 at a point to € I, then the 
functions Y1, Y2 defined on I are linearly independent. 


The results mentioned above provide different properties of the Wronskian of two func- 
tions. But none of these results is what we need to finish the proof of Theorem 2.1.7. In 
order to finish that proof we need one more result, Abel’s Theorem. 


2.1.5. Abel’s Theorem. We now show that the Wronskian of two solutions of a 
differential equation satisfies a differential equation of its own. This result is known as 
Abel’s Theorem. 

Theorem 2.1.12 (Abel). If y1, yo are twice continuously differentiable solutions of 
y” +.a,(t) y! + a(t) y = 0, (2.1.6) 
where a1, Ao are continuous on I CR, then the Wronskian W,, satisfies 
Wi, + ai(t) Win = 0. 
Therefore, for any to € I, the Wronskian W, is given by the expression 
W12(t) = Wia(to) e~ 42, 


where A,(t) = / a,(s) ds. 


to 


Proof of Theorem 2.1.12: We start computing the derivative of the Wronskian function, 


/ 
Wi = (Yi ¥2 — Yi Y2) =I. V2 — Yt Yo. 
Recall that both y, and y, are solutions to Eq. (2.1.6), meaning, 


NW ! ye ! 
Yy = —41 YU, — Go YW; Yo = —41 Yo — Ao Yr: 


Replace these expressions in the formula for W/, above, 


Wi = ( ay Y Qo Yo) ( ay ui Qo 7) Yo => Wi = a1 (uy: Ys _ ui 7) 
So we obtain the equation 
Wi, +ai(t) Win = 0. 
This equation for W,, is a first order linear equation; its solution can be found using the 


method of integrating factors, given in Section 1.1, which results is the expression in the 
Theorem 2.1.12. This establishes the Theorem. 
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We now show one application of Abel’s Theorem. 


Example 2.1.11. Find the Wronskian of two solutions of the equation 
Py” —t(t+2)y' + (t+2)y=0, i>, 


Solution: Notice that we do not known the explicit expression for the solutions. Neverthe- 
less, Theorem 2.1.12 says that we can compute their Wronskian. First, we have to rewrite 
the differential equation in the form given in that Theorem, namely, 


2 2 1 
i _ = 1) / (5 =) = 0. 
y (7 wae Tet ae 
Then, Theorem 2.1.12 says that the Wronskian satisfies the differential equation 
2 
w(t) — (= es 1) W(t) = 0. 


This is a first order, linear equation for W,2, so its solution can be computed using the 
method of integrating factors. That is, first compute the integral 


-f 2+1)a= 2In( =) (t — to) 


= in(%) —(t—t). 


Then, the integrating factor uu is given by 
te (tt 
u(t) = #2 € ( °), 
which satisfies the condition ju(t)) = 1. So the solution, W,, is given by 
t: 
(u(t)Wialt)) =0 = p(t) Waalt) — (te) Wialto) = 0 
so, the solution is 
t? 
Wa2(t) = Waalto) gr), 
0 
If we call the constant c = W12(to)/[t2e°], then the Wronskian has the simpler form 
W,2(t) = ct?et. 
dq 


We now state and prove the statement we need to complete the proof of Theorem 2.1.7. 


Theorem 2.1.13 (Wronskian II). If y,, y. are fundamental solutions of L(y) =0 onI CR, 
then W,.(t) #0 on I. 


Remark: Instead of proving the Theorem above, we prove an equivalent statement—the 
negative statement. 


Corollary 2.1.14 (Wronskian II). Jf y1, y2 are solutions of L(y) =0 on I CR and there 
is a point t, € I such that Wi2(t,) = 0, then y1, Yo are linearly dependent on I. 


Proof of Corollary 2.1.14: We know that y,, y, are solutions of L(y) = 0. Then, Abel’s 
Theorem says that their Wronskian W,, is given by 


W,2(t) = Wi2(to) ew), 
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for any t) € I. Chossing the point ty to be t,, the point where by hypothesis W,,(t,) = 0, 
we get that 
Wr (t) => 0 for all te de: 


Knowing that the Wronskian vanishes identically on J, we can write 


Ys Yo — Vs Yo = 0, 
on I. If either y, or y2 is the function zero, then the set is linearly dependent. So we 
can assume that both are not identically zero. Let’s assume there exists t, € J such that 
yi(t,) 4 0. By continuity, y, is nonzero in an open neighborhood J, C I of ¢,. So in that 
neighborhood we can divide the equation above by y?, 


rat ! 
ia 2 0 => (“) =0 = 2B=c of, 

Yt UN 
where c € R is an arbitrary constant. So we conclude that y; is proportional to y2 on the 
open set J,. That means that the function y(t) = y(t) — cy,(t), satisfies 

L(y) =0, — y(ts) =0, y(t) = 0. 

Therefore, the existence and uniqueness Theorem 2.1.2 says that y(t) = 0 for allt € I. This 
finally shows that y; and yp are linearly dependent. This establishes the Theorem. 
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2.1.6. Exercises. 


2.1.1.- Find the constants c and k such that 
the function y(t) = ct* is solution of 


PytePyt4ty=1. 


2.1.2.- Let y(t) =cait+ce t? be the general 
solution of a second order linear differ- 
ential equation L(y) = 0. By eliminat- 
ing the constants c; and cz, find the dif- 
ferential equation satisfied by y. 


2.1.3.- (a) Verify that y(t) = ¢ and 
yo(t) = 1/t are solutions to the dif- 
ferential equation 


Py” — 2y=0, t>0. 


(b) Show that y(t) = at? + : is so- 
lution of the same equation for all 
constants a,b € R. 


2.1.4.- Find the longest interval where the 
solution y of the initial value problems 
below is defined. (Do not try to solve 
the differential equations.) 


(a) ty" + 6y = 2t, y(1) = 2, y'(1) 


(b) (¢ — 6)y’ + 3ty’ — y = 1, 413) 
—1, y/(3) =2. 


3. 


2.1.5.- If the graph of y, solution to a sec- 
ond order linear differential equation 
L(y(t)) = 0 on the interval [a, 6], is tan- 
gent to the t-axis at any point to € [a,b], 
then find the solution y explicitly. 


2.1.6.- Can the function y(t) = sin(t?) be 
solution on an open interval containing 
t = 0 of a differential equation 
y+ a(t) y’ + b(t)y = 0, 


with continuous coefficients a and b? 


Explain your answer. 


2.1.7.- Compute the Wronskian of the fol- 
lowing functions: 


(a) ft) = fa g 
(b) f(z) = 2, g(a) 
(c) f(8) = cos” (6), 


og 
a0 ) a 1+ cos(26). 


2.1.8.- Verify whether the functions y1, y2 


below are a fundamental set for the dif- 
ferential equations given below: 


(a) yi(t) = cos(2t), yo(t) = sin(2¢), 


(c) yi(x) =, yo(t) = xe”, 
ay” —Qx(x +2)y' + (a@+2)y =0. 


2.1.9.- If the Wronskian of any two solu- 


tions of the differential equation 


y" +plt)y' +at)y =0 
is constant, what does this imply about 
the coefficients p and q? 


2.1.10.- * Suppose yi is solution of the IVP 


yi tary tay = 0, es oe 
yi(0) =5, 

and y2 is solution of the IVP 
yi tary, +aoy =0, oe a 
yi(0) =1 


that is, same differential equation and 
same initial condition for the function, 
but different initial conditions for the 
derivatives. Then show that the func- 
tions y; and y2 must be proportional to 
each other, 


yi(t) = cya(t) 
and find the proportionality factor c. 


Hint 1: Theorem 2.1.2 says that the 
initial value problem 


y(0) =0, 
y'(0) =0, 
has a unique solution and it is y(t) = 0 
for all t. 
Hint 2: Find what is the initial value 
problem for the function 

ye(t) = y(t) — cye(t), 


and fine tune c to use hint 1. 


y +a1y' +acy=0, 
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2.2. Reduction of Order Methods 


Sometimes a solution to a second order differential equation can be obtained solving two 
first order equations, one after the other. When that happens we say we have reduced the 
order of the equation. We use the ideas in Chapter 1 to solve each first order equation. 

In this section we focus on three types of differential equations where such reduction of 
order happens. The first two cases are usually called special second order equations and the 
third case is called the conservation of the energy. 

We end this section with a method that provides a second solution to a second order 
equation if you already know one solution. The second solution can be chosen not propor- 
tional to the first one. This idea is called the reduction order method—although all four 
ideas we study in this section do reduce the order of the original equation. 


2.2.1. Special Second Order Equations. A second order differential equation is 
called special when either the function, or its first derivative, or the independent variable 
does not appear explicitly in the equation. In these cases the second order equation can 
be transformed into a first order equation for a new function. The transformation to get 
the new function is different in each case. Then, one solves the first order equation and 
transforms back solving another first order equation to get the original function. We start 
with a few definitions. 


Definition 2.2.1. A second order equation in the unknown function y is an equation 


y =fG,y,¥)- 


where the function f : R°? > R is given. The equation is linear iff function f is linear in 
both arguments y and y’. The second order differential equation above is special iff one of 
the following conditions hold: 


(a) y" = f(t,y.y'), the function y does not appear explicitly in the equation; 
(ob) y’ = f(%b,y’), the variable t does not appear explicitly in the equation. 
(c) y" = If YX), the variable t, the function y’ do not appear explicitly in the equation. 


It is simpler to solve special second order equations when the function y is missing, 
case (a), than when the variable t is missing, case (b), as it can be seen by comparing 
Theorems 2.2.2 and 2.2.3. The case (c) is well known in physics, since it applies to Newton’s 
second law of motion in the case that the force on a particle depends only on the position 
of the particle. In such a case one can show that the energy of the particle is conserved. 

Let us start with case (a). 


Theorem 2.2.2 (Function y Missing). If a second order differential equation has the form 
y” = f(t,y’), thenv=y’ satisfies the first order equation v' = f(t,v). 


The proof is trivial, so we go directly to an example. 


Example 2.2.1. Find the y solution of the second order nonlinear equation y” = —2t (y’)? 
with initial conditions y(0) = 2, y’(0) = —1. 


Solution: Introduce v = y’. Then v’ = y”, and 


1 
vl = —2tv? Be —2t == +6 
v 
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1 
So, = = t? — ¢, that is, y’ = p . The initial condition implies 
y —c 


-1=y'(0) =- 


1 
c 


dt 
Then, y = +c. We integrate using the method of partial fractions, 
y P-1 & & 


Wess 1 «a 2% b 
@-1  (¢-1)(4+1) @-1) ° @+1) 
1 1 
Hence, 1 = a(t + 1) + b(t — 1). Evaluating at t = 1 and t = —1 we get a= 5 b= =a So 
1 | 1 1 
#—1° 2l(t-1) (+1) 
Therefore, the integral is simple to do, 
1 1 
y= 5(nlt-1|-Inft +1) +e. 2=y(0) =5(0-0) +e 
1 
We conclude y = 5 (In |t 1| —In|t + 1|) +2. <J 


The case (b) is way more complicated to solve. 
Theorem 2.2.3 (Variable t Missing). If the initial value problem 
Y=fyy), yO=%, y(O)=H, 


has an invertible solution y, then the function 


w(y) = v(t(y)), 


where v(t) = y'(t), and t(y) is the inverse of y(t), satisfies the initial value problem 


w = f(y, aw) w(Yo) = ty 
w 
dw 


where we denoted w = — 
dy 


Remark: The proof is based on the chain rule for the derivative of functions. 


Proof of Theorem 2.2.3: The differential equation is y” = f(y,y’). Denoting v(t) = y’(t) 
v' = f(y,v) 


It is not clear how to solve this equation, since the function y still appears in the equation. 
On a domain where y is invertible we can do the following. Denote t(y) the inverse values 
of y(t), and introduce w(y) = v(t(y)). The chain rule implies 


_dw) _ my dt) _v@®) _v@®y) _ fy.) 
~~ dyly — dt ley) dyluy) —-y’(H)tw v(t) Ie v(t) 


f(y, wy) 


ty) w(y) 


wy) 


d d 
where w(y) = a and v'(t) = oe Therefore, we have obtained the equation for w, namely 
y 


f(y, w) 


W 


w= 
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Finally we need to find the initial condition fro w. Recall that 


y(t = 0) = Yo = t(y = yo) = 0, 

y(t=0 = S v(t = 0) =. 
Therefore, 

wy = yo) = v(tly = yo)) = v(t =0) =H wW(Yo) = Ys- 


This establishes the Theorem. 


Example 2.2.2. Find a solution y to the second order equation y” = 2yy’. 


Solution: The variable t does not appear in the equation. So we start introduciong the 
function u(t) = y/(t). The equation is now given by v’(t) = 2y(t) v(t). We look for invertible 
solutions y, then introduce the function w(y) = v(t(y)). This function satisfies 


/ / 


ly) = = (2 *) exe) aes 
Yay dt dy/\ty) oy! Itty) Ue) 
Using the differential equation, 

. 2yv dw 

w(y) = —— —— = dy wy) =y? +e. 

v_ ley) dy 
Since v(t) = w(y(t)), we get v(t) = y?(t) +c. This is a separable equation, 
y(t) 
y?(t) +e 


Since we only need to find a solution of the equation, and the integral depends on whether 
c>0,c=0,c <0, we choose (for no special reason) only one case, c = 1. 


d 
/ oe je +c => arctan(y) =t+cqy(t) = tan(t +o). 


1-4? 
Again, for no reason, we choose cy = 0, and we conclude that one possible solution to our 
problem is y(t) = tan(t). <J 


Example 2.2.3. Find the solution y to the initial value problem 
yy +3(y')? =0, (0) = 1, (0) = 6. 


Solution: We start rewriting the equation in the standard form 


The variable t does not appear explicitly in the equation, so we introduce the function 
u(t) = y'(t). The differential equation now has the form v’(t) = —3v?(t)/y(t). We look for 
invertible solutions y, and then we introduce the function w(y) = u(t(y)). Because of the 
chain rule for derivatives, this function satisfies 


. dw e =) vu vu! . vu’ (t(y)) 
WwW = — = —_— — =S— = — => WwW = 
) dy ) = ae dy/\ty) —-y! ley) ou le) ) w(y) 
Using the differential equation on the factor v’, we get 
—3y2 1 —3yw?2 oa 
iN 3u°(t(y)) _ =38w Se ge sw 


y w yw y 
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This is a separable equation for function w. The problem for w also has initial conditions, 
which can be obtained from the initial conditions from y. Recalling the definition of inverse 
function, 


y(t =0)=1 t(y=1)=0. 
Therefore, 


wy = 1) = v(t(y = 1)) = v(0) = (0) =6, 
where in the last step above we use the initial condition y’(0) = 6. Summarizing, the initial 
value problem for w is 


The equation for w is separable, so the method from § 1.3 implies that 
In(w) = -3ln(y) +o =n(y3)+o => wly)=ay?, a =e. 
The initial condition fixes the constant c,, since 
6=wl=a > wly)=6y 
We now transform from w back to v as follows, 
v(t) = w(y(t)) =6y-9(t) => y(t) = by *(2). 


This is now a first order separable equation for y. Again the method from § 1.3 imply that 


y! 
y? y 6 ms 6t + Co 
The initial condition for y fixes the constant cz, since 
1 y* 1 
1=y(0) => Z O+ce = mi 6t + A 


So we conclude that the solution y to the initial value problem is 
y(t) = (24¢ + 1)*. 
4] 


2.2.2. Conservation of the Energy. We now study case (c) in Def. 2.2.1—second 
order differential equations such that both the variable t and the function y’ do not appear 
explicitly in the equation. This case is important in Newtonian mechanics. For that reason 
we slightly change notation we use to write the differential equation. Instead of writing the 
equation as y” = f(y), as in Def. 2.2.1, we write it as 


my” = f(y), 


where m is aconstant. This notation matches the notation of Newton’s second law of motion 
for a particle of mass m, with position function y as function of time t, acting under a force 
f that depends only on the particle position y. 

It turns out that solutions to the differential equation above have a particular property: 
There is a function of y’ and y, called the energy of the system, that remains conserved 
during the motion. We summarize this result in the statement below. 


Theorem 2.2.4 (Conservation of the Energy). Consider a particle with positive mass m 
and position y, function of time t, which is a solution of Newton’s second law of motion 


my” = f(y), 
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with initial conditions y(to) = yo and y'(to) = vo, where f(y) is the force acting on the 
particle at the position y. Then, the position function y satisfies 


1 
pine” + V(y) = Ev 


where Ey = 4mvz + V(yo) is fixed by the initial conditions, v(t) = y'(t) is the particle 
velocity, and V is the potential of the force f—the negative of the primitive of f, that is 


Remark: The term T(v) = $mv” is the kinetic energy of the particle. The term V(y) is 


the potential energy. The Theorem above says that the total mechanical energy 
E=T(v)+V(y) 
remains constant during the motion of the particle. 


Proof of Theorem 2.2.4: We write the differential equation using the potential V, 
Multiply the equation above by y’, 


Use the chain rule on both sides of the equation above, 

dl jee d 

5 (sm) =-ZV@@). 
Introduce the velocity v = y’, and rewrite the equation above as 


“(5m wy + V(y)) = 0. 


This means that the quantity 
1 
Ely.v) = 5m? + Vy), 


called the mechanical energy of the system, remains constant during the motion. Therefore, 
it must match its value at the initial time t), which we called Ey in the Theorem. So we 
arrive to the equation 


1 
E(y,v) = smu +V(y) = Eo. 


This establishes the Theorem. 


Example 2.2.4. Find the potential energy and write the energy conservation for the fol- 
lowing systems: 


(i) A particle attached to a spring with constant k, moving in one space dimension. 

(it) A particle moving vertically close to the Earth surface, under Earth’s constant gravita- 
tional acceleration. In this case the force on the particle having mass m is f(y) = mg, 
where g = 9.81 m/s?. 

(itt) A particle moving along the direction vertical to the surface of a spherical planet with 
mass M and radius R. 
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Solution: 
Case (i). The force on a particle of mass m attached to a spring with spring constant 
k > 0, when displaced an amount y from the equilibrium position y = 0 is f(y) = —ky. 


Therefore, Newton’s second law of motion says 
my” =—ky. 


The potential in this case is V(y) = $ky*, since —dV/dy = —ky = f. If we introduce the 
particle velocity v = y’, then the total mechanical energy is 


1 1 
E(y,v) = zm + 5 ky” 
The conservation of the energy says that 

Tt ope 

= hy = Ey, 

aie + 5 Yy 0; 
where £) is the energy at the initial time. 
Case (ii). Newton’s equation of motion says: my” = mg. If we multiply Newton’s 
equation by y’, we get 

1 
my y"=mgy => a(5m'? +mgy) =0 

If we introduce the the gravitational energy 


1 
E(y,v) = yw + mgy, 


dE 
where v = y’, then Newton’s law says a 0, so the total gravitational energy is constant, 


1 
yw +mgy = E(0). 


Case (iii). Consider a particle of mass m moving on a line which is perpendicular to the 
surface of a spherical planet of mass M and radius R. The force on such a particle when is 
at a distance y from the surface of the planet is, according to Newton’s gravitational law, 


GMm 


f(y) = ~(R+y)2’ 


where G = 6.67 x 107" ake is Newton’s gravitational constant. The potential is 


GMm 
4 = ’ 
w) (R+y) 
since —dV/dy = f(y). The energy for this system is 
1 GMm 
E(y,v) = =mv? — ——— 
CD ae Ray) 
where we introduced the particle velocity v = y’. The conservation of the energy says that 
1 , GMm 
=mvu* — ——~ = h, 
2 (E+y) ° 


where E) is the energy at the initial time. <J 
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Example 2.2.5. Find the maximum height of a ball of mass m = 0.1 Kg that is shot 
vertically by a spring with spring constant k = 400 Kg/s? and compressed 0.1 m. Use 
g = 10 m/s?. 


Solution: This is a difficult problem to solve if one tries to find the position function y and 
evaluate it at the time when its speed vanishes—maximum altitude. One has to solve two 
differential equations for y, one with source f; = —ky—mg and other with source fz = —mg, 
and the solutions must be glued together. The first source describes the particle when is 
pushed by the spring under the Earth’s gravitational force. The second source describes 
the particle when only the Earth’s gravitational force is acting on the particle. Also, the 
moment when the ball leaves the spring is hard to describe accurately. 
A simpler method is to use the conservation of the mechanical and gravitational energy. 

The energy for this particle is 

1 1 

E(t) = 5 mv + 5 ky? + mgy. 

This energy must be constant along the movement. In particular, the energy at the initial 
time t = 0 must be the same as the energy at the time of the maximum height, ty,, 

1 1 1 

E(t=0)=E(tu) = 2 muy a 3 kys + MGYo = 2 mn + mgyu- 

But at the initial time we have vu, = 0, and y = —0.1, (the negative sign is because the 
spring is compressed) and at the maximum time we also have vy, = 0, hence 


i. g ks 
a kyo +mMgyo=Mgym > Yu =Y+ts—Y- 
2 2mg 


We conclude that yj, = 1.9 m. <l 


Example 2.2.6. Find the escape velocity from Earth—the initial velocity of a projec- 
tile moving vertically upwards starting from the Earth surface such that it escapes Earth 
gravitational attraction. Recall that the acceleration of gravity at the surface of Earth is 
g = GM/R? = 9.81m/s?, and that the radius of Earth is R = 6378 Km. Here M denotes 
the mass of the Earth, and G is Newton’s gravitational constant. 


Solution: The projectile moves in the vertical direction, so the movement is along one space 
dimension. Let y be the position of the projectile, with y = 0 at the surface of the Earth. 
Newton’s equation in this case is 


n _ GMm 
eS Ry 
We start rewriting the force using the constant g instead of G, 
GMm GM mR? gmR? 
(Rey? (R+yP? (Ry)? 
So the equation of motion for the projectile is 
ii gmR? 
my = —-——; 


(R+y)? 
The projectile mass m can be canceled from the equation above (we do it later) so the result 
will be independent of the projectile mass. Now we introduce the gravitational potential 
gmR? 


Viy) = Gaal 
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We know that the motion of this particle satisfies the conservation of the energy 
1 4  gmR? 


mu — 
2 (R+y) 
where v = y’. The initial energy is simple to compute, y(0) = 0 and v(0) = v9, so we get 


= Eo, 


1 4 gmR? 1 4 
mou (t = =mvu mR. 
gee) 
We now cancel the projectile mass from the equation, and we rewrite the equation as 
2gR? 
2 2 
u(t) = uf — 2gR + ——_.. 
° (R+ y(t) 


Now we choose the initial velocity vp to be the escape velocity v.. The latter is the smallest 
initial velocity such that v(t) is defined for all y including y > oo. Since 
2gR? 


v?(t) > 0 and R+y) 


> 0, 


this means that the escape velocity must satisfy 
v2 —22gR>0. 
Since the escape velocity is the smallest velocity satisfying the condition above, that means 
ve=V2gR => v. =11.2Km/s. 
dq 


Example 2.2.7. Find the time t,, for a rocket to reach the Moon, if it is launched at 
the escape velocity. Use that the distance from the surface of the Earth to the Moon is 
d = 405, 696 Km. 


Solution: From Example 2.2.6 we know that the position function y of the rocket satisfies 
the differential equation 
2gR? 

(R+y(t))’ 
where R is the Earth radius, g the gravitational acceleration at the Earth surface, v = y/’, 
and vp is the initial velocity. Since the rocket initial velocity is the Earth escape velocity, 
Uy = Ve = V2gR, the differential equation for y is 

(y’)? = 2gR? = y’ _ V2gR , 

(R+y) VR+y 

where we chose the positive square root because, in our coordinate system, the rocket leaving 


Earth means v > 0. Now, the last equation above is a separable differential equation for y, 
so we can integrate it, 


(R+y?@y=/29R = “(R+y)'? = ,/2g Ri +c, 


where c is a constant, which can be determined by the initial condition y(t = 0) = 0, since 
at the initial time the projectile is on the surface of the Earth, the origin of out coordinate 
system. With this initial condition we 


v(t) = ve —2gR+ 


2 
c= GR? a 5 24.4) 3? = /2g Rt+ ae (2.91) 
From the equation above we can compute an explicit form of ce solution function y, 


= G 2g Rt+ Rea)" =f; (2.2.2) 
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To find the time to reach the Moon we need to evaluate Eq. (2.2.1) for y = d and get ty, 


2 2 2. al 
—Redpe = /ehins -. -S ty=]—— = (R40 =), 
3( +d) ght + 3 M>=3 Jig R (( + d) ) 
The formula above gives t),; = 51.5 hours. <J 


2.2.3. The Reduction of Order Method. If we know one solution to a second 
order, linear, homogeneous, differential equation, then one can find a second solution to 
that equation. And this second solution can be chosen to be not proportional to the known 
solution. One obtains the second solution transforming the original problem into solving 
two first order differential equations. 


Theorem 2.2.5 (Reduction of Order). If a nonzero function y, is solution to 


y” +ay(t) y’ + a(t) y = 0. (223) 
where a1, Ao are given functions, then a second solution not proportional to y, is 
=n) [a (2.2.4) 
Yo(t) = yrlt : wares ie 
: : yi (t) 


where A(t) = f a,(t) dt. 


Remark: In the first part of the proof we write y,(t) = v(t) y,(t) and show that y, is 
solution of Eq. (2.2.3) iff the function v is solution of 


ul + (ote + a,(t)) of =O (2.2.5) 


In the second part we solve the equation for v. This is a first order equation for for w = v’, 
since v itself does not appear in the equation, hence the name reduction of order method. 
The equation for w is linear and first order, so we can solve it using the integrating factor 
method. One more integration gives v, which is the factor multiplying y, in Eq. (2.2.4). 


Remark: The functions v and w in this subsection have no relation with the functions v 
and w from the previous subsection. 


Proof of Theorem 2.2.5: We write y. = vy, and we put this function into the differential 
equation in 2.2.3, which give us an equation for v. To start, compute yj, and yf, 


W=VYwtVY,  Y=v" wt’ y +vyy. 
Introduce these equations into the differential equation, 
O= (vy + Quy, + vy) + as (vy + UY) + Gov Ys 
=y,v" + (2Qy, tary) vu’ + (yl! + a1 yl + ao 1) v- 
The function y; is solution to the differential original differential equation, 
Yi + ary; + G04. = 0, 


then, the equation for v is given by 
yiv' + (2Qyit+ary)v.=0. > vl + (24 + a1) — 


This is Eq. (2.2.5). The function v does not appear explicitly in this equation, so denoting 
w =v’ we obtain : 
w+ (24 +a;) w=0. 
"1 
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This is is a first order linear equation for w, so we solve it using the integrating factor 
method, with integrating factor 


ui =POeaO,. where Ath = / Gt 
Therefore, the differential equation for w can be rewritten as a total t-derivative as 
e7~A1(#) 
= wo 
yi (t) 
Since vu’ = w, we integrate one more time with respect to ¢ to obtain 


| i e~A1(t) 1 
v(t) = w —.— dt+ vo. 
*  ye(t) : 


We are looking for just one function v, so we choose the integration constants wo = 1 and 
vo = 0. We then obtain 


= fa eae |= a 
= fea = we)=nl) [Poa 
yi (t) ; yi (t) 
For the furthermore part, we now need to show that the functions y, and y. = vy; are 
linearly independent. We start computing their Wronskian, 


(y2 e4* w)’ =0 > y2 et w = wo => wi(t) 


YW UY 

yy (v'ys + vy;) 
Recall that above in this proof we have computed v’ = w, and the result was w = wy e~41/y?. 
So we get v'y? = w, e741, and then the Wronskian is given by 

At 


Wi = =n(o'y + oy) — omy, > Wr=v'y?. 


Wi = Wo Ee 


This is a nonzero function, therefore the functions y, and y2 = vy, are linearly independent. 
This establishes the Theorem. 


Example 2.2.8. Find a second solution y, linearly independent to the solution y,(t) = t of 
the differential equation 
t?y” + Qty’ — 2y = 0. 
Solution: We look for a solution of the form y(t) = t v(t). This implies that 
y, =tv' +0, yy =tu" +20’. 

So, the equation for v is given by 

O0=t7?(tv" + 2v') + 2¢(tv’ + v) — tv 

= ty” + (2t? + 2t7) vo! + (2t — 2t) uv 


=fu"+(47)0 => vo" 4+ my =0. 
Notice that this last equation is precisely Eq. (??), since i our case we have 
yi =t, p(t) = 2 => tu + [24 Sd =0. 
The equation for v is a first order equation for w = v’, given by 
w! 4 


—=-—- = w(t)=«t*, c, ER. 
w t 


Therefore, integrating once again we obtain that 


v=ot 3 +¢s, C2,C3 € R, 
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and recalling that y. = tv we then conclude that 
Yo = cot? + Cst. 


Choosing c, = 1 and cs = 0 we obtain that y(t) = t~?. Therefore, a fundamental solution 
set to the original differential equation is given by 


wO=t wll) =. 
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2.2.4. Exercises. 


2.2.1.- Consider the differential equation 
Py’ +3ty’-3=0, t>0, 


with initial conditions 
y(1)=3, y(l)=>5. 


(a) Find the initial value problem sat- 
isfied by v(t) = y'(t). 

(b) Solve the differential equation for v. 

(c) Find the solution y of the differen- 
tial equation above. 


2.2.2.- Consider the differential equation 
yy +3(y')? =0, 
with initial conditions 


y(0)=1, y/(0) =5. 

(a) Find the differential equation sat- 
isfied by w(y) = v(t(y)), where 
v(t) =y'(0. 

(b) Find the initial condition satisfied 
by the function w. 

(c) Solve the initial value problem for 
the function w. 

(d) Use the solution w found above to 
set up a first order initial value 
problem for y. 

(e) Find the solution y of the differen- 
tial equation above. 


2.2.3.- Solve the differential equation 


/ 


” y 
Saar 
yr 
with initial conditions 
1 
y0)=1, y'(0) = 5. 
2.2.4.- Use the reduction order method to 
find a second solution y2 to the differ- 
ential equation 


am y +8ty' +12y=0, 
knowing that y:(¢) = 1/¢° is a solution. 
The second solution yz must not contain 
any term proportional to y1. 


2.2.5.- * Use the reduction order method to 
find a solution y2 of the equation 
Py” +2ty’ —6y=0 
knowing that y: = t? is a solution to 
this equation. 
(a) Write y2 = vyi and find yp and yy. 
(b) Find the differential equation for v. 
(c) Solve the differential equation for v 
and find the general solution. 
(d) Choose the simplest function v such 
that yo and yi are fundamental so- 


lutions of the differential equation 
above. 
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2.3. Homogenous Constant Coefficients Equations 


All solutions to a second order linear homogeneous equation can be obtained from any 
pair of nonproportional solutions. This is the main idea given in § 2.1, Theorem 2.1.7. In 
this section we obtain these two linearly independent solutions in the particular case that 
the equation has constant coefficients. Such problem reduces to solve for the roots of a 
degree-two polynomial, the characteristic polynomial. 


2.3.1. The Roots of the Characteristic Polynomial. Thanks to the work done 
in § 2.1 we only need to find two linearly independent solutions to second order linear 
homogeneous equations. Then Theorem 2.1.7 says that every other solution is a linear 
combination of the former two. How do we find any pair of linearly independent solutions? 
Since the equation is so simple, having constant coefficients, we find such solutions by trial 
and error. Here is an example of this idea. 


Example 2.3.1. Find solutions to the equation 


y” + 5y’ + 6y = 0. (2.3.1) 


Solution: We try to find solutions to this equation using simple test functions. For example, 
it is clear that power functions y = t” won’t work, since the equation 


n(n — 1) t-?) + 5nt@-) 4 64" =0 


cannot be satisfied for all t € R. We obtained, instead, a condition on t. This rules out 
power functions. A key insight is to try with a test function having a derivative proportional 
to the original function, 


y(t) =r y(t). 
Such function would be simplified from the equation. For example, we try now with the 
test function y(t) = e”'. If we introduce this function in the differential equation we get 


r2+5r+6)e" = & r?+5r+6=0. 3, 
24 5r+6)e =0 24 5r+6=0 2.3.2 


We have eliminated the exponential and any t dependence from the differential equation, 
and now the equation is a condition on the constant r. So we look for the appropriate values 
of r, which are the roots of a polynomial degree two, 


1 go ae 
i 9 (—5 25 — 24) ~~ 9 (—5 _ 1) {"* = —3. 


We have obtained two different roots, which implies we have two different solutions, 
w(t)=e*", p(t) =e. 


These solutions are not proportional to each other, so the are fundamental solutions to the 
differential equation in (2.3.1). Therefore, Theorem 2.1.7 in § 2.1 implies that we have found 
all possible solutions to the differential equation, and they are given by 


y(t) = ae *+a0™, C1,C2 ER. (2.3.3) 

dq 

From the example above we see that this idea will produce fundamental solutions to 

all constant coefficients homogeneous equations having associated polynomials with two 


different roots. Such polynomial play an important role to find solutions to differential 
equations as the one above, so we give such polynomial a name. 
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Definition 2.3.1. The characteristic polynomial and characteristic equation of the 
second order linear homogeneous equation with constant coefficients 
y” + ay’ + ay = 0, 


are given by 
P(r) =r? tar tao, p(r) = 0. 


As we saw in Example 2.3.1, the roots of the characteristic polynomial are crucial to 
express the solutions of the differential equation above. The characteristic polynomial is a 
second degree polynomial with real coefficients, and the general expression for its roots is 


1 
5 (-a: + Va? - as). 


If the discriminant (a? — 4a,) is positive, zero, or negative, then the roots of p are different 
real numbers, only one real number, or a complex-conjugate pair of complex numbers. For 
each case the solution of the differential equation can be expressed in different forms. 


Theorem 2.3.2 (Constant Coefficients). Ifr+ are the roots of the characteristic polynomial 
to the second order linear homogeneous equation with constant coefficients 


y + ary’ + doy = 0, (2.3.4) 


and if c., c- are arbitrary constants, then the following statements hold true. 


(a) Ifr, Ar_, real or complex, then the general solution of Eq. (2.3.4) is given by 
tone" +e 

(b) Ifr, =r-=1> ER, then the general solution of Eq. (2.8.4) is given by 
Hulh=Gge™ ete, 


Furthermore, given real constants ty, Yo and y,, there is a unique solution to the initial value 
problem given by Eq. (2.8.4) and the initial conditions y(to) = yo and y' (to) = y- 


Remarks: 

(a) The proof is to guess that functions y(t) = e™’ must be solutions for appropriate values of 
the exponent constant r, the latter being roots of the characteristic polynomial. When 
the characteristic polynomial has two different roots, Theorem 2.1.7 says we have all 
solutions. When the root is repeated we use the reduction of order method to find a 
second solution not proportional to the first one. 

(b) At the end of the section we show a proof where we construct the fundamental solutions 
Y1, Yo Without guessing them. We do not need to use Theorem 2.1.7 in this second proof, 
which is based completely in a generalization of the reduction of order method. 


Proof of Theorem 2.3.2: We guess that particular solutions to Eq. 2.3.4 must be expo- 
nential functions of the form y(t) = e”, because the exponential will cancel out from the 
equation and only a condition for r will remain. This is what happens, 


ret tae’ +ae>=0 3S r+arta=O0. 


The second equation says that the appropriate values of the exponent are the root of the 
characteristic polynomial. We now have two cases. If r, # r- then the solutions 


wl)=e™, y(t) =e, 
are linearly independent, so the general solution to the differential equation is 


Yeen(t) = ce"? +e.e"*. 
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If r, = r- = 1, then we have found only one solution y,(t) = e"’, and we need to find a 
second solution not proportional to y,. This is what the reduction of order method is perfect 
for. We write the second solution as 


y-(t)=v(t)y(t) => y(t) =v{t)e™, 
and we put this expression in the differential equation (2.3.4), 
(v" + 2rov’ + ure) ert (v' + Tov) a,e™* + au e™* = 0. 


We cancel the exponential out of the equation and we reorder terms, 


vo” + (2rp + a4) vu! + (r2 + aro +a) v = 0. 


We now need to use that ro is a root of the characteristic polynomial, r? + aro + do = 0, 
so the last term in the equation above vanishes. But we also need to use that the root ro is 
repeated, 
a 1 a 
T= 55 az — 4a) = + => %Wt+a,=0. 
The equation on the right side above implies that the second term in the differential equation 


for v vanishes. So we get that 


vy" =0 > v(t) =cq+ Cot 


and the second solution is y-(t) = (c, + cot) y(t). If we choose the constant c. = 0, the 
function y_ is proportional to y,. So we definitely want c, 4 0. The other constant, c,, only 
adds a term proportional to y,, we can choose it zero. So the simplest choice is c, = 0, 
Cp = 1, and we get the fundamental solutions 


(o=2" wire, 
So the general solution for the repeated root case is 
Vent) =ae™ +ate™, 


The furthermore part follows from solving a 2 x 2 linear system for the unknowns c, and c.. 
The initial conditions for the case r, # r_ are the following, 


Y=ce" +c, Y =T.c.e + 7rc_e™, 
It is not difficult to verify that this system is always solvable and the solutions are 


fe (T-Yo — Y1) — (T+Yo — Y1) 
(rn, =r) erste’ “(ry — r_) er-to" 


The initial conditions for the case r_ = r_ = ro are the following, 


Yo = (4 + ct) e7™, y= ce" +4 r(c, + cto) 7. 
It is also not difficult to verify that this system is always solvable and the solutions are 
y 
a, = Yor tolrode = Ys) = — (ovo = Yo) 
a eroto y on eroto : 


This establishes the Theorem. 


Example 2.3.2. Find the solution y of the initial value problem 
y+ 5y'+6y=0, yO)=1,  y/(0)=-1. 


Solution: We know that the general solution of the differential equation above is 


Ygen(t) = ce 2! 4+ ce, 
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We now find the constants c, and c_ that satisfy the initial conditions above, 
1l=y(0)=ate C, = 2, 
4 => 
—1l=y'(0) = —2c, — 3c. c=-1. 
Therefore, the unique solution to the initial value problem is 


y(t) = 2e-** — e- **, 


Example 2.3.3. Find the general solution ygcn of the differential equation 


2y" — 3y' +y = 0. 


Solution: We look for every solutions of the form y(t) = e”™, where r is solution of the 
characteristic equation 


re=1 
1 y] 
Qr? — 3r+1=0 r=7(3+V9-8) 1 
T= 2 
Therefore, the general solution of the equation above is 
Ygen(t) = ae oF ee. 
dq 
Example 2.3.4. Find the solution to the initial value problem 
5) 
oy" + 6y'+y=0, y(0)=1,  ¥O)=3- 


Solution: The characteristic polynomial is p(r) = 9r? + 6r + 1, with roots given by 


ra = Z(-6+ V6 36) See 


3 
Theorem 2.3.2 says that the general solution has the form 


Yeen(t) = c a tee, 


We need to compute the derivative of the expression above to impose the initial conditions, 
, _ C+ 4/3 EN. 4/3 
Ygen(t) = — 3 € 4e(l=7 le ’ 


then, the initial conditions imply that 


1=y(0) =a, 
5 ! Cy => C= 1, (= 2 
~=—7'(0)=—-t+e 
ie! eee 
So, the solution to the initial value problem above is: y(t) = (1 + 2t) e~'/3. <J 


Example 2.3.5. Find the general solution ygen of the equation 


y” — 2y' + 6y = 0. 


Solution: We first find the roots of the characteristic polynomial, 


1 
r? —2r+6=0 rt 5 (2+ 4 — 24) re =1+iv5. 
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Since the roots of the characteristic polnomial are different, Theorem 2.3.2 says that the 
general solution of the differential equation above, which includes complex-valued solutions, 
can be written as follows, 


Yeon(t) = GeltivV5t +e el-V5t & B EC. 


dq 
2.3.2. Real Solutions for Complex Roots. We study in more detail the solutions 


to the differential equation (2.3.4) in the case that the characteristic polynomial has complex 
roots. Since these roots have the form 


rh= 5 + ; Va? — 4a, 


the roots are complex-valued in the case a? — 4a) < 0. We use the notation 
1 


2 

: . ay az 
re=ati~, with a=-—, = 4/a — —. 
B 2, B=ya5- “ 

The fundamental solutions in Theorem 2.3.2 are the complex-valued functions 
(a+iB)t 


i. =e 
The general solution constructed from these solutions is 
Yeen(t) = G el@t*t +e elo EC. 


This formula for the general solution includes real valued and complex valued solutions. 
But it is not so simple to single out the real valued solutions. Knowing the real valued 
solutions could be important in physical applications. If a physical system is described by a 
differential equation with real coefficients, more often than not one is interested in finding 
real valued solutions. For that reason we now provide a new set of fundamental solutions 
that are real valued. Using real valued fundamental solution is simple to separate all real 
valued solutions from the complex valued ones. 


Theorem 2.3.3 (Real Valued Fundamental Solutions). If the differential equation 
y' +ay' +ay=0, (2.3.5) 


where a1, dp are real constants, has characteristic polynomial with complex roots re = ati 
and complez valued fundamental solutions 


g.(t) = el" (t) = elo *9)E, 


then the equation also has real valued fundamental solutions given by 


y.(t) = e™ cos(Bt), y-(t) = e™ sin(t). 


Proof of Theorem 2.3.3: We start with the complex valued fundamental solutions 
posers, gijaee-rh, 
We take the function y, and we use a property of complex exponentials, 
Gu (t) = clot! = er eft — e (cos(Bt) + isin(Bt)), 


where on the last step we used Euler’s formula e’? = cos(9)+7sin(@). Repeat this calculation 
for y_. we get, 


g.(t) = e™ (cos(6t) +isin(St)), g(t) = e** (cos(St) — isin(ft)). 
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If we recall the superposition property of linear homogeneous equations, Theorem 2.1.5, 
we know that any linear combination of the two solutions above is also a solution of the 
differential equation (2.3.6), in particular the combinations 


1 1 
yf) =5G-(Q+E0), vl) = = - 7). 
A straightforward computation gives 
y(t) =e cos(Bt), _y-(t) = et sin(6t). 


This establishes the Theorem. 


Example 2.3.6. Find the real valued general solution of the equation 


y” — 2y' + 6y = 0. 


Solution: We already found the roots of the characteristic polynomial, but we do it again, 
1 
r? —2r+6=0 rt 5 (2 4 — 24) re =1+iv5. 

So the complex valued fundamental solutions are 


EG =", Ea 


Theorem ?? says that real valued fundamental solutions are given by 
y.(t) =e’ cos(V5t), y-(t) = et sin(V5t). 
So the real valued general solution is given by 
Ygen(t) = (c. cos(V5t) + c-sin(V5t)) e*, a, c ER. 
dq 
Remark: Sometimes it is difficult to remember the formula for real valued solutions. One 
way to obtain those solutions without remembering the formula is to start repeat the proof 


of Theorem 2.3.3. Start with the complex valued solution y, and use the properties of the 
complex exponential, 


g(t) = et tiV5)t — of eV — ef (cos(V/5t) + isin(V5t)). 


The real valued fundamental solutions are the real and imaginary parts in that expression. 


Example 2.3.7. Find real valued fundamental solutions to the equation 


y +2y +6y=0. 


Solution: The roots of the characteristic polynomial p(r) = r? + 2r + 6 are 


1 1 
re = 5[-24 V4— 24] = 5[-2 + V—20] re = —-1Ltiv5. 
These are complex-valued roots, with 
a=-l, B= V5. 


Real-valued fundamental solutions are 


y(t) =e-* cos(V5t), —-ya(t) = e~* sin(V5t). 
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Second order differential equations with 
characteristic polynomials having com- 
plex roots, like the one in this exam- 
ple, describe physical processes related 
to damped oscillations. An example 
from physics is a pendulums with fric- 
tion. <J 


FIGURE 1. Solutions from Ex. 2.3.7. 


Example 2.3.8. Find the real valued general solution of y” + 5y = 0. 


Solution: The characteristic polynomial is p(r) = r? +5, with roots rz = +V5i. In this 
case a = 0, and 8 = V5. Real valued fundamental solutions are 


v(t) =cos(VBt), —_y-(t) = sin(VB2). 
The real valued general solution is 


Yeen(t) = c, cos(W5t) +c. sin(v5t), a,c ER. 


J 


Remark: Physical processes that oscillate in time without dissipation could be described 
by differential equations like the one in this example. 


2.3.3. Constructive Proof of Theorem 2.3.2. We now present an alternative proof 
for Theorem 2.3.2 that does not involve guessing the fundamental solutions of the equation. 
Instead, we construct these solutions using a generalization of the reduction of order method. 


Proof of Theorem 2.3.2: The proof has two main parts: First, we transform the original 
equation into an equation simpler to solve for a new unknown; second, we solve this simpler 
problem. 

In order to transform the problem into a simpler one, we express the solution y as a 
product of two functions, that is, y(t) = u(t)v(t). Choosing v in an appropriate way the 
equation for u will be simpler to solve than the equation for y. Hence, 


y = ww ya=votvu s yl=u'vt2ud’ +0". 


Therefore, Eq. (2.3.4) implies that 


(uly + Qu’! + vu) + a, (u'v + v'u) + ay uv = 0, 


that is, 
fw" + (a + 2") ul + do ule +(v" +a,v')u=0. (2.3.6) 
We now choose the function v such that 
443° 25 Yc (2.3.7) 
v v 2 


We choose a simple solution of this equation, given by 


v(t) = gree 
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Having this expression for v one can compute v’ and uv”, and it is simple to check that 


2 


vw" +a,v' = = v. (2.3.8) 


Introducing the first equation in (2.3.7) and Eq. (2.3.8) into Eq. (2.3.6), and recalling that 
v is non-zero, we obtain the simplified equation for the function u, given by 


2 


u” —ku=0, k= 7 — do. (2.3.9) 


Eq. (2.3.9) for u is simpler than the original equation (2.3.4) for y since in the former there 
is no term with the first derivative of the unknown function. 

In order to solve Eq. (2.3.9) we repeat the idea followed to obtain this equation, that 
is, express function wu as a product of two functions, and solve a simple problem of one of 
the functions. We first consider the harder case, which is when k 4 0. In this case, let us 
express u(t) = eV* w(t). Hence, 


ul = VkeV*wtevVw = ul = keV w+ aVkeVP ww! + eV ww”, 
Therefore, Eq. (2.3.9) for function u implies the following equation for function w 
O=u" —ku=eV" (2Vkw'+w") > wl +2Vkw! =0. 


Only derivatives of w appear in the latter equation, so denoting x(t) = w’(t) we have to 
solve a simple equation 


a =-2Vkx => 2(t) = ae72V*, ro ER. 
Integrating we obtain w as follows, 
/ : —2vkt _y w(t) = “Go it 
renaming ¢; = —a/(2Vk), we obtain 
w(t) = ce 2 Rt +o => ut)= coev Rt + ce VF, 
We then obtain the expression for the solution y = uv, given by 
y(t) = qe 2 VE heel sev Re 
Since k = (a?/4 — ay), the numbers 


rea ptvk re = 5(-a + Vaz — 4a) 


are the roots of the characteristic polynomial 


r? +aprt+ do = 0, 
we can express all solutions of the Eq. (2.3.4) as follows 
y(t) =cqe™ +ce™', k #0. 
Finally, consider the case k = 0. Then, Eq. (2.3.9) is simply given by 
u’=0 => ult)=(q+at) oa€R. 
Then, the solution y to Eq. (2.3.4) in this case is given by 
y(t) = (ce + qt) eM”. 


Since k = 0, the characteristic equation r? +a; r+d = 0 has only one root ry = r_ = —a,/2, 
so the solution y above can be expressed as 


y(t) = (co +t) e™, =a 
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The Furthermore part is the same as in Theorem 2.3.2. This establishes the Theorem. 


Notes. 

(a) In the case that the characteristic polynomial of a differential equation has repeated 
roots there is an interesting argument to guess the solution y-. The idea is to take a 
particular type of limit in solutions of differential equations with complex valued roots. 

Consider the equation in (2.3.4) with a characteristic polynomial having complex 


valued roots given by re = a+i(, with 
2 
ay a? 
a=-—, = 4/a) — —. 
2 P oo a 


Real valued fundamental solutions in this case are given by 

g. = e** cos(Bt), g- = e%* sin(At). 
We now study what happen to these solutions j, and g- in the following limit: The 
variable t is held constant, a is held constant, and 6 —> 0. The last two conditions 
are conditions on the equation coefficients, a,, @). For example, we fix a, and we vary 


dy — a?/4 from above. 
Since cos(Gt) + 1 as 6 — 0 with t fixed, then keeping a fixed too, we obtain 


g.(t) = e*' cos(Bt) —> e*' = y,(t). 
sin(Gt) 


Since a + 1as 6 > 0 with ¢ constant, that is, sin(Gt) > Bt, we conclude that 

p(t) _ sin(Bt) 4, _ sin(6t) 

B B pt 

The calculation above says that the function g-/(3 is close to the function y_(t) = te®* in 
the limit @ — 0, t held constant. This calculation provides a candidate, y-(t) = ty.(t), 
of a solution to Eq. (2.3.4). It is simple to verify that this candidate is in fact solution of 
Eq. (2.3.4). Since y_ is not proportional to y,, one then concludes the functions y,, y- are 
a fundamental set for the differential equation in (2.3.4) in the case the characteristic 
polynomial has repeated roots. 


te? —+ te**= y(t). 


(b) Brief Review of Complex Numbers. 
e Complex numbers have the form z = a+ ib, where i? = —1. 
e The complex conjugate of z is the number 7 = a — 1b. 
e Re(z) =a, Im(z) = bd are the real and imaginary parts of z 
e Hence: Re(z) = . and Im(z) = sea 


e The exponential of a complex number is defined as 


co * 
atib __ (a =F ib)” 
ett yeti 
n=0 
In particular holds e = ete. 
e Euler’s formula: e’ = cos(b) + isin(b). 
e Hence, a complex number of the form e 


eatid _ ea (cos(b) ie isin(b)), er — oe (cos(b) - isin(b)). 
a+ib 


a+ib 


a+ can be written as 


e Frome and e*—* we get the real numbers 
1 ‘ ; 1 ' : 
5 (eat + e%—*) = e* cos(b), oF (ett — 4%) = e% sin(b). 
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2.3.4. Exercises. 


2.3.1.- Consider the differential equation 
y — Ty’ +12y = 0. 

(a) Find the roots rz of the characteris- 
tic polynomial associated with the 
differential equation. 

(b) Use the roots rz above to find fun- 
damental solutions ys of the differ- 
ential equation. 

(c) Solve the differential equation 
above with initial conditions 


y(0)=1, y/(0) =—-1. 
2.3.2.- Consider the differential equation 
y” — 8y' + 25y = 0. 

(a) Find the roots rz of the characteris- 
tic polynomial associated with the 
differential equation. 

(b) Use the roots rz above to find real 
valued fundamental solutions y: of 
the differential equation. 


(c) Solve the differential equation 
above with initial conditions 


y(0)=2, y'(0)=2. 
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2.3.3.- Consider the differential equation 


(a) 


y — 6y' +9y = 0. 
Find the roots rz of the characteris- 
tic polynomial associated with the 
differential equation. 
Use the roots rz above to find real 
valued fundamental solutions ys of 
the differential equation. 
Solve the differential equation 
above with initial conditions 


y(0)=1, yO) =2. 


2.3.4.- * Consider the differential equation 


y — 4y' + 4y =0. 
Find one solution of the form 
yi(t) =e. 
Use the reduction order method 
to find a second solution 


yo(t) = v(t) y(t). 
First find the differential equation 
satisfied by v(t). 
Find all solutions v(t) of the differ- 
ential equation in part (b). 
Choose a function v such that the 
associated solution yz does not con- 
tain any term proportional to y1. 


112 2. SECOND ORDER LINEAR EQUATIONS 


2.4. Euler Equidimensional Equation 


Second order linear equations with variable coefficients are in general more difficult to solve 
than equations with constant coefficients. But the Euler equidimensional equation is an 
exception to this rule. The same ideas we used to solve second order linear equations with 
constant coefficients can be used to solve Euler’s equidimensional equation. Moreover, there 
is a transformation that converts Euler’s equation into a linear equation. 


2.4.1. The Roots of the Indicial Polynomial. The Euler equidimensional equation 
appears, for example, when one solves the two-dimensional Laplace equation in polar coordi- 
nates. This happens if one tries to find the electrostatic potential of a two-dimensional charge 
configuration having circular symmetry. The Euler equation is simple to recongnize—the 
coefficient of each term in the equation is a power of the independent variable that matches 
the order of the derivative in that term. 


Definition 2.4.1. The Euler equidimensional equation for the unknown y with singular 
point at tp € R is given by the equation below, where a, and do are constants, 


=i, a" +a (t—to) y’ tary =0. 


Remarks: 


(a) This equation is also called Cauchy equidimensional equation, Cauchy equation, Cauchy- 
Euler equation, or simply Euler equation. As George Simmons says in [10], “Euler 
studies were so extensive that many mathematicians tried to avoid confusion by naming 
subjects after the person who first studied them after Euler.” 


(b) The equation is called equidimensional because if the variable ¢ has any physical di- 


: ‘ d a 
mensions, then the terms with (t — to) a? for any nonnegative integer n, are actually 
dimensionless. 


(c) The exponential functions y(t) = e”’ are not solutions of the Euler equation. Just 
introduce such a function into the equation, and it is simple to show that there is no 
constant r such that the exponential is solution. 


(d) The particular case tp = 0 is 
Py" +oty' +oy=0. 
We now summarize what is known about solutions of the Euler equation. 
Theorem 2.4.2 (Euler Equation). Consider the Euler equidimensional equation 
(t—t))? y" +a, ¢-—t)y +a,¥=0, i> kh (2.4.1) 


where a1, Go, and ty are real constants, and denote by r. the roots of the indicial polynomial 
pir) =r(r—1) tar +a. 
(a) Ifr. Ar, real or complex, then the general solution of Eq. (2.4.1) is given by 


Yoen(t) =ca(t—t)* +e($—t)”, t£>tb, a, CER. 
(b) Ifr, =r-=1>) ER, then the general solution of Eq. (2.4.1) is given by 
Ygen(t) = ce (t— bo)" +e (6—t))" N(tE-t&), tot, a, c ER. 
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Furthermore, given real constants t, >to, Yo and y1, there is a unique solution to the initial 
value problem given by Eq. (2.4.1) and the initial conditions 


y(ts) = Yor yf (ts) = tr. 


Remark: We have restricted to a domain with t > tp. Similar results hold for t < t). In 
fact one can prove the following: If a solution y has the value y(t — t)) at tt) > 0, then 
the function gy defined as (t — to) = y(—(t — to)), for t — to < 0 is solution of Eq. (2.4.1) 
for t — tp < 0. For this reason the solution for t 4 ty is sometimes written in the literature, 
see [3] § 5.4, as follows, 

Yeua(t) = colt — tol” +elt— tol”, re Xr, 

Yeon (t) = cx |t — to|"° +c. |t—to|” Injt—t|, rm =r-=To. 
However, when solving an initial value problem, we need to pick the domain that contains 


the initial data point t,. This domain will be a subinterval in either (—oo, to) or (to, 00). For 
simplicity, in these notes we choose the domain (tg, 00). 


The proof of this theorem closely follows the ideas to find all solutions of second order 
linear equations with constant coefficients, Theorem 2.3.2, in § 2.3. In that case we found 
fundamental solutions to the differential equation 

y +a,y' tay =0, 
and then we recalled Theorem 2.1.7, which says that any other solution is a linear combina- 
tion of a fundamental solution pair. In the case of constant coefficient equations, we looked 
for fundamental solutions of the form y(t) = e™, where the constant r was a root of the 
characteristic polynomial 
r? +ayr +a) = 0. 
When this polynomial had two different roots, r, 4 r_, we got the fundamental solutions 
~pH=e gH 
When the root was repeated, r. = r- = To, we used the reduction order method to get the 
fundamental solutions 


wp =e™, y(t) te, 
Well, the proof of Theorem 2.4.2 closely follows this proof, replacing the exponential function 
by power functions. 


Proof of Theorem 2.4.2: For simplicity we consider the case t) = 0. The general case 
ty # 0 follows from the case tp = 0 replacing t by (t — t)). So, consider the equation 


Py +aty' +ay=0, t>0. 
We look for solutions of the form y(t) = t”, because power functions have the property that 
yar) = ty=rt. 
A similar property holds for the second derivative, 
yor(r-1)r? => Py’ =rr-1)e. 


When we introduce this function into the Euler equation we get an algebraic equation for r, 


[r(r — 1) +r + ao] t’ =0 r(r—1) +ar+a)=0. 
The constant r must be a root of the indicial polynomial 


pir) =r(r—1) tar + ao. 
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This polynomial is sometimes called the Euler characteristic polynomial. So we have two 
possibilities. If the roots are different, r, 4 r_, we get the fundamental solutions 


y(t) =t™, y(t)=t™. 
If we have a repeated root r, = r- = 1, then one solution is y,(t) = t”. To obtain the second 


solution we use the reduction order method. Since we have one solution to the equation, y,, 
the second solution is 


y-(t) = v(t)y.(t) = y(t) = v(t) t”. 
We need to compute the first two derivatives of y_, 
yi =ryvt™? 1 +u't”, y! =7(ro — 1L)vt™? + Qrou’ to) +" t”. 
We now put these expressions for y-, y/ and y” into the Euler equation, 
e (ro(ro — 1)u fro? Orgy! tot + yl” t”) + ayt (rov fro} py! i) +a,ut™ =0. 
Let us reorder terms in the following way, 
yl gor? 4 (Qry + ay) yl ttt + [ro(rro —1)+aro+ Ao] vt’ =0. 
We now need to recall that rp is both a root of the indicial polynomial, 
Yo(To — 1) + auto +a = 0 
and ro is a repeated root, that is (a, — 1)? = 4ao, hence 


(a: — 1) 
2 
Using these two properties of rp in the Euler equation above, we get the equation for v, 


vl gret? 4 gl eroth — Qo > op" t+’ =0. 


T= 2ro+a,= 1. 


This is a first order equation for w = v’, 
w 
wt+w=0 => (tw)! =0 > wi(t)= Fs 


We now integrate one last time to get function v, 


vas => v(t) = woln(t) + vo. 


So the second solution to the Euler equation in the case of repeated roots is 
y-(t) = (wo In(t) +) t => y(t) = wot” In(t) + v9 y(t). 
It is clear we can choose vp = 0 and w, = 1 to get 
y-(t) = t” In(t). 


We got fundamental solutions for all roots of the indicial polynomial, and their general 
solutions follow from Theorem 2.1.7 in § 2.1. This establishes the Theorem. 


Example 2.4.1. Find the general solution of the Euler equation below for t > 0, 
fy" + 4ty +2y=0. 
Solution: We look for solutions of the form y(t) = t”, which implies that 
iyvijort, Py" @=r(r-t, 
therefore, introducing this function y into the differential equation we obtain 
[r(r — 1) + 4r +2] t” =0 r(r—1)+4r4+2=0. 
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The solutions are computed in the usual way, 
1 r,=-l1 
r?+3r+2=0 m= 5[-3+ 9-8] = { _ 
So the general solution of the differential equation above is given by 


taH=et*+e8. 


Remark: Both fundamental solutions in the example above diverge at t = 0. 


Example 2.4.2. Find the general solution of the Euler equation below for t > 0, 
ty” —3ty'+4y=0. 
Solution: We look for solutions of the form y(t) = ¢”, then the constant r must be solution 
of the Euler characteristic polynomial 
r(r—1)-8r+4=0 r?—4r+4=0 r= T= 2. 


Therefore, the general solution of the Euler equation for t > 0 in this case is given by 


Ygen(t) = c,t? + ct? In(t). 


dq 
Example 2.4.3. Find the general solution of the Euler equation below for t > 0, 
ty” —3ty' +13y =0. 
Solution: We look for solutions of the form y(t) = ¢”, which implies that 
tyart, Py"()=r(r-1?, 
therefore, introducing this function y into the differential equation we obtain 
[r(r — 1) — 8r + 13] #” =0 r(r—1) —38r+13=0. 
The solutions are computed in the usual way, 
1 Tr, =24+ 32 
2 — = —_ = 
r? —4r+13=0 oP sy 36] => ae 
So the general solution of the differential equation above is given by 
tlt) =f) ae gm, (2.4.2) 
dq 


2.4.2. Real Solutions for Complex Roots. We study in more detail the solutions 
to the Euler equation in the case that the indicial polynomial has complex roots. Since 
these roots have the form 

(a,—1) 1 


r= 5 +5 (a, — 1)? — 4ao, 
the roots are complex-valued in the case (py — 1)? — 4q < 0. We use the notation 
—1 — 1)2 
r,=a+iB, with a=), B= a 


The fundamental solutions in Theorem 2.4.2 are the complex-valued functions 
Eat, waa des), 
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The general solution constructed from these solutions is 
Yeon(t) = é, 17) 4 e-) BEC. 


This formula for the general solution includes real valued and complex valued solutions. 
But it is not so simple to single out the real valued solutions. Knowing the real valued 
solutions could be important in physical applications. If a physical system is described by a 
differential equation with real coefficients, more often than not one is interested in finding 
real valued solutions. For that reason we now provide a new set of fundamental solutions 
that are real valued. Using real valued fundamental solution is simple to separate all real 
valued solutions from the complex valued ones. 


Theorem 2.4.3 (Real Valued Fundamental Solutions). If the differential equation 
(t-—%)? y" +a(t—t)y +ay=0, t te; (2.4.3) 


where a1, Qo, to are real constants, has indicial polynomial with complex roots r, = a +18 
and complex valued fundamental solutions for t > to, 


g()=(¢-t)or), F(t) = -t)°™, 
then the equation also has real valued fundamental solutions for t >t. given by 


y.(t) = (t — to)* cos(8 In(t — to)), y-(t) = (t — to)® sin(@ In(t — to)). 


Proof of Theorem 2.4.3: For simplicity consider the case tp) = 0. Take the solutions 
g.(t) = photie) 7-(t) = plo 4B) - 
Rewrite the power function as follows, 
Ga (t) = tOt#F) & go 488 — po elm") = ym iP) =, (t) =e cP O), 
A similar calculation yields 
gh) =e FeO, 

Recall now Euler formula for complex exponentials, e’® = cos(@) + isin(@), then we get 

g.(t) = t* [cos(BIn(t)) + isin(BIn(t))], g(t) = t* [cos(B In(t)) — isin(Z In(¢))]. 
Since 7 and g- are solutions to Eq. (2.4.3), so are the functions 


w(t) = 5 [a(t +a()], walt) = 5: [a(t - (0). 


It is not difficult to see that these functions are 
y.(t) = t* cos(8 In(t)), y-(t) = t* sin( In(¢)). 


To prove the case having t) 4 0, just replace t by (t—t,) on all steps above. This establishes 
the Theorem. 


Example 2.4.4. Find a real-valued general solution of the Euler equation below for t > 0, 


ty” —3ty' +13y=0. 


Solution: The indicial equation is r(r — 1) — 3r + 13 = 0, with solutions 
r?—4r4+13=0 > r.=2431, r=2-—34. 
A complex-valued general solution for t > 0 is, 


Ygen(t) = € P+) 42 10-3) 6 BEC. 
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A real-valued general solution for t > 0 is 
Ygen(t) = cx t? cos(3 In(t)) + c. t? sin(3n(t)), Ga, c ER. 
dq 


2.4.3. Transformation to Constant Coefficients. Theorem 2.4.2 shows that y(t) = 
t=, where r, are roots of the indicial polynomial, are solutions to the Euler equation 


ty” +a,ty’ tay =0, t>0. 


The proof of this theorem is to verify that the power functions y(t) = t" solve the differential 
equation. How did we know we had to try with power functions? One answer could be, this 
is a guess, a lucky one. Another answer could be that the Euler equation can be transformed 
into a constant coefficient equation by a change of the independent variable. 


Theorem 2.4.4 (Transformation to Constant Coefficients). The function y is solution of 
the Euler equidimensional equation 
ty" + arty’ tary =0, t>0 (2.4.4) 
iff the function u(z) = y(t(z)), where t(z) = e*, satisfies the constant coefficients equation 
a+ (a,—1)t+au=0, zER, (2.4.5) 
where y’ = dy/dt and tu = du/dz. 


Remark: The solutions of the constant coefficient equation in (2.4.5) are u(z) = e”**, where 
r, are the roots of the characteristic polynomial of Eq. (2.4.5), 
re + (a, —1)r, + a) = 0, 
that is, r, must be a root of the indicial polynomial of Eq. (2.4.4). 
(a) Consider the case that r, 4 r-. Recalling that y(t) = u(z(t)), and z(t) = In(t), we get 
pO Suge) =e age a) ee. gets, 


(b) Consider the case that r, = r_ =o. Recalling that y(t) = u(z(t)), and z(t) = In(¢), we 
get that y,(¢) = t™, while the second solution is 


y(t) = u(z(t)) = z(t) e? = In(t)e™™©O =In(t)e™@™ = y(t) =In(t)t”. 


Proof of Theorem 2.4.4: Given t > 0, introduce t(z) = e*. Given a function y, let 


u(z) = y(t(z)) = u(z) = y(e*). 
Then, the derivatives of u and y are related by the chain rule, 


= He) = Hey 2 = vee) 


“! = ye) 


so we obtain 
u(z) =ty'(t), 
where we have denoted u = du/dz. The relation for the second derivatives is 


a(2) = “(ty ®) £@ = v'O +0) 2 =v" + vw) 


so we obtain 
ii(z) = y(t) + ty (2). 
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Combining the equations for & and t% we get 
Py" =i-4, ty’ =u. 
The function y is solution of the Euler equation t? y” + a,t y’ + ao y = 0 iff holds 
ti-t+at+au=0 > t+(a,-lt+au=0. 
This establishes the Theorem. 


2.4.4. Exercises. 


2.4.1.- . 
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2.4.2.- . 
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2.5. Nonhomogeneous Equations 


All solutions of a linear homogeneous equation can be obtained from only two solutions that 
are linearly independent, called fundamental solutions. Every other solution is a linear com- 
bination of these two. This is the general solution formula for homogeneous equations, and it 
is the main result in § 2.1, Theorem 2.1.7. This result is not longer true for nonhomogeneous 
equations. The superposition property, Theorem 2.1.5, which played an important part to 
get the general solution formula for homogeneous equations, is not true for nonhomogeneous 
equations. 

We start this section proving a general solution formula for nonhomogeneous equations. 
We show that all the solutions of the nonhomogeneous equation are a translation by a fixed 
function of the solutions of the homogeneous equation. The fixed function is one solution— 
it doesn’t matter which one—of the nonhomogenous equation, and it is called a particular 
solution of the nonhomogeneous equation. 

Later in this section we show two different ways to compute the particular solution of a 
nonhomogeneous equation—the undetermined coefficients method and the variation of pa- 
rameters method. In the former method we guess a particular solution from the expression of 
the source in the equation. The guess contains a few unknown constants, the undetermined 
coefficients, that must be determined by the equation. The undetermined method works for 
constant coefficients linear operators and simple source functions. The source functions and 
the associated guessed solutions are collected in a small table. This table is constructed by 
trial and error. In the latter method we have a formula to compute a particular solution 
in terms of the equation source, and fundamental solutions of the homogeneous equation. 
The variation of parameters method works with variable coefficients linear operators and 
general source functions. But the calculations to find the solution are usually not so simple 
as in the undetermined coefficients method. 


2.5.1. The General Solution Formula. The general solution formula for homoge- 
neous equations, Theorem 2.1.7, is no longer true for nonhomogeneous equations. But there 
is a general solution formula for nonhomogeneous equations. Such formula involves three 
functions, two of them are fundamental solutions of the homogeneous equation, and the 
third function is any solution of the nonhomogeneous equation. Every other solution of the 
nonhomogeneous equation can be obtained from these three functions. 


Theorem 2.5.1 (General Solution). Every solution y of the nonhomogeneous equation 


Ly) =f, (2.5.1) 
with L(y) =y" +a,y' +aoy, where ay, do, and f are continuous functions, is given by 
Y= CLY1 + Co Yo + Yp, 
where the functions y,: and yo are fundamental solutions of the homogeneous equation, 


L(y) = 0, L(y) = 0, and yp is any solution of the nonhomogeneous equation L(yp) = f. 


Before we proof Theorem 2.5.1 we state the following definition, which comes naturally 
from this Theorem. 


Definition 2.5.2. The general solution of the nonhomogeneous equation L(y) = f is a 
two-parameter family of functions 


Ygon(t) = C1 ys(t) + C2 Yo(t) + Yp(t), (2.5.2) 


where the functions y, and y, are fundamental solutions of the homogeneous equation, 
L(y) = 0, L(y) = 0, and yp is any solution of the nonhomogeneous equation L(yp) = f. 
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Remark: The difference of any two solutions of the nonhomogeneous equation is actually a 
solution of the homogeneous equation. This is the key idea to prove Theorem 2.5.1. In other 
words, the solutions of the nonhomogeneous equation are a translation by a fixed function, 
Yp, Of the solutions of the homogeneous equation. 


Proof of Theorem 2.5.1: Let y be any solution of the nonhomogeneous equation L(y) = f. 
Recall that we already have one solution, yp, of the nonhomogeneous equation, L(yp) = f. 
We can now subtract the second equation from the first, 


L(y) — L(y) = f—-f=0 L(y — Yp) = 0. 
The equation on the right is obtained from the linearity of the operator L. This last equation 
says that the difference of any two solutions of the nonhomogeneous equation is solution of 
the homogeneous equation. The general solution formula for homogeneous equations says 
that all solutions of the homogeneous equation can be written as linear combinations of a 
pair of fundamental solutions, y;, y2. So the exist constants c,, c, such that 


Y — Yp = C1 Y1 + C2 Yo. 


Since for every y solution of L(y) = f we can find constants c,, c. such that the equation 
above holds true, we have found a formula for all solutions of the nonhomogeneous equation. 
This establishes the Theorem. 


2.5.2. The Undetermined Coefficients Method. The general solution formula 
in (2.5.2) is the most useful if there is a way to find a particular solution y, of the nonho- 
mogeneous equation L(y,) = f. We now present a method to find such particular solution, 
the undetermined coefficients method. This method works for linear operators L with con- 
stant coefficients and for simple source functions f. Here is a summary of the undetermined 
coefficients method: 


(1) Find fundamental solutions y,, y. of the homogeneous equation L(y) = 0. 

(2) Given the source functions f, guess the solutions y, following the Table 1 below. 

(3) If the function y, given by the table satisfies L(y,) = 0, then change the guess to typ. 
If ty satisfies L(ty,) = 0 as well, then change the guess to t?yp. 

(4) Find the undetermined constants k in the function y, using the equation L(y) = f, 
where y is Yp, or typ or t7Yp. 


f(t) (Source) (K, m, a, b, given.) Yp(t) (Guess) (k not given.) 

Ke kes! 

[io ee ee Feigl?® 2x hy 

KK, cos(bt) + K, sin(bt) k, cos(bt) + k sin(bt) 

(Kmt™ +++» + Ko) e* (Kmt™ + +++ + ko) e% 

(K, cos(bt) + K, sin(bt)) e% (k, cos(bt) + kz sin(bt)) e@ 

(Kint™ +++++ Ko)(K, cos(bt) + Ke sin(bt)) || (Kmt™ +++ + ko) (ki cos(bt) + k sin(bt)) 


TABLE 1. List of sources f and solutions y, to the equation L(y,) = f. 
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This is the undetermined coefficients method. It is a set of simple rules to find a 
particular solution y, of an nonhomogeneous equation L(y,) = f in the case that the source 
function f is one of the entries in the Table 1. There are a few formulas in particular cases 
and a few generalizations of the whole method. We discuss them after a few examples. 


Example 2.5.1 (First Guess Right). Find all solutions to the nonhomogeneous equation 


y" = 3y! _ Ay os 3 et. 


Solution: From the problem we get L(y) = y” — 3y’ — 4y and f(t) = 3e”. 

(1): Find fundamental solutions y,, y- to the homogeneous equation L(y) = 0. Since the 

homogeneous equation has constant coefficients we find the characteristic equation 
r—-8r-4=0 > .=4, n=-1, => »y(t)=e", w= =e". 

(2): The table says: For f(t) = 3e7* guess y,(t) = ke?'. The constant k is the undetermined 

coefficient we must find. 


(3): Since y,(t) = ke?’ is not solution of the homogeneous equation, we do not need to 
modify our guess. (Recall: L(y) = 0 iff exist constants c,, c. such that y(t) = c, e4# +c e~'.) 


(4): Introduce y, into L(y,) = f and find k. So we do that, 
(2? -6—4)ke*=3e% 3 -6k=3 => k= =~ 


We guessed that y, must be proportional to the exponential e”’ in order to cancel out the 
exponentials in the equation above. We have obtained that 


if 
yp(t) = me ev 


The undetermined coefficients method gives us a way to compute a particular solution y, of 
the nonhomogeneous equation. We now use the general solution theorem, Theorem 2.5.1, 
to write the general solution of the nonhomogeneous equation, 


Remark: The step (4) in Example 2.5.1 is a particular case of the following statement. 


Theorem 2.5.3. Consider the equation L(y) = f, where L(y) = y"+a, y'+ao y has constant 
coefficients and p is its characteristic polynomial. If the source function is f(t) = K e*, 
with p(a) #0, then a particular solution of the nonhomogeneous equation is 


at 


Proof of Theorem 2.5.3: Since the linear operator L has constant coefficients, let us 
write L and its associated characteristic polynomial p as follows, 


Ly) =y" +ay +ay, p(r) =r? +ayr tao. 


Since the source function is f(£) = K e“, the Table 1 says that a good guess for a particular 
soution of the nonhomogneous equation is yp(t) = ke*’. Our hypothesis is that this guess 
is not solution of the homogenoeus equation, since 


L(yp) = (a? + aya + ao) ke™ = p(a)ke*, and p(a) #0. 
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We then compute the constant k using the equation L(y,) = f, 


k 
a+aata)ke™=Ke™ = pla)ke =Ke* 3S k= —~. 
p(a) 


K 
We get the particular solution y,(t) = a e*t. This establishes the Theorem. 
p(a 


Remark: As we said, the step (4) in Example 2.5.1 is a particular case of Theorem 2.5.3, 
3 3 2t 3 ot 1 x 
= — => t)=-= . 

(@2—6-4)° 6° ult) =—5e 


In the following example our first guess for a particular solution y, happens to be a 
solution of the homogenous equation. 


Example 2.5.2 (First Guess Wrong). Find all solutions to the nonhomogeneous equation 


y” — 8y’ — 4y = 3e*. 


Solution: If we write the equation as L(y) = f, with f(t) = 3e*, then the operator L is 
the same as in Example 2.5.1. So the solutions of the homogeneous equation L(y) = 0, are 
the same as in that example, 


y(t)=e*, y(t) =e". 


The source function is f(t) = 3e*", so the Table 1 says that we need to guess yp(t) = ke*. 
However, this function y, is solution of the homogeneous equation, because 


Yo=ky => L(y) =0. 
We have to change our guess, as indicated in the undetermined coefficients method, step (3) 
yp (t= kie™. 


This new guess is not solution of the homogeneous equation. So we proceed to compute the 
constant k. We introduce the guess into L(y,) = f, 


yp = (1+ 4t) ke*, yp, = (8 + 16t) ke™* => [8-3+(16-12-4)é] ke =3e*, 
therefore, we get that 
3 3 


k k= t)= te. 
5k =3 > yp(t) gte 


The general solution theorem for nonhomogneneous equations says that 


In the following example the equation source is a trigonometric function. 


Example 2.5.3 (First Guess Right). Find all the solutions to the nonhomogeneous equation 
y” — 3y’ — 4y = 2sin(t). 
Solution: If we write the equation as L(y) = f, with f(t) = 2sin(t), then the operator L 


is the same as in Example 2.5.1. So the solutions of the homogeneous equation L(y) = 0, 
are the same as in that example, 
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Since the source function is f(t) = 2sin(t), the Table 1 says that we need to choose the 
function y,(t) = k, cos(t) + ky sin(t). This function y, is not solution to the homogeneous 
equation. So we look for the constants k,, k, using the differential equation, 


Yp = —hy sin(t) + kz cos(t), y}, = —ky cos(t) — ky sin(t), 
and then we obtain 
[—k, cos(t) — ky sin(t)] — 3[—k, sin(t) + k, cos(t)] — 4[k, cos(t) + kp sin(t)] = 2 sin(t). 
Reordering terms in the expression above we get 
(—5k, — 3k,) cos(t) + (3k, — 5k,) sin(t) = 2sin(t). 


The last equation must hold for all ¢ € R. In particular, it must hold for t = 7/2 and for 
t = 0. At these two points we obtain, respectively, 


3k, — 5k. = 2, - ki = Ta 
—5k, — 3k, = 0, 


So the particular solution to the nonhomogeneous equation is given by 


Yp(t) = 7 [3 cos(t) — 5sin(t)]. 


The general solution theorem for nonhomogeneous equations implies 
Ygen(t) =c,e"* +ce-e + - [3 cos(t) — 5 sin(t)]. 
dq 
The next example collects a few nonhomogeneous equations and the guessed function 
Yp- 


Example 2.5.4. We provide few more examples of nonhomogeneous equations and the 
appropriate guesses for the particular solutions. 


(a) For y” — 3y' — 4y = 3e”‘ sin(t), guess, yp(t) = [k, cos(t) + k, sin(t)] e?*. 
(b) For y” — 3y’ — 4y = 2t? e?, guess, yp(t) = (kot? + kit + ko) e*. 


) 
) 

(c) For y” — 3y’ — 4y = 2t? e*, guess, yp(t) = (kat? + kyt+ ko) te. 
) 


(d) For y” — 3y/ — 4y = 3t sin(t), guess, y,(t) = (kit + ko) [ky cos(t) + kp sin(t)]. 
dq 


Remark: Suppose that the source function f does not appear in Table 1, but f can be 
written as f = f,+ fo, with f, and f, in the table. In such case look for a particular solution 
Yp = Yp1 + Yp2, Where L(yp,) = f, and L(yp,) = fo. Since the operator L is linear, 


L(Yp) = L(Yp, ote Yp2) = L(Yp,) aT L(Yp2) HSfth=f => L(Yp) =f. 


Example 2.5.5. Find all solutions to the nonhomogeneous equation 


y” — 3y' — 4y = 3e* + Qsin(t). 
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Solution: If we write the equation as L(y) = f, with f(t) = 2sin(t), then the operator L 
is the same as in Example 2.5.1 and 2.5.3. So the solutions of the homogeneous equation 
L(y) = 0, are the same as in these examples, 


y(t)=e", y(t) =e. 
The source function f(t) = 3e7’ + 2sin(t) does not appear in Table 1, but each term does, 
fi(t) = 3e”' and f,(t) = 2sin(t). So we look for a particular solution of the form 
Yp = Vp, +Yp,, Where L(yp,)=3e", L(yp,) = 2sin(t). 


We have chosen this example because we have solved each one of these equations before, in 
Example 2.5.1 and 2.5.3. We found the solutions 


1 1 , 
Yn (t) === Ee, Yoo(t) = rT (3 cos(t) — 5sin(t)). 
Therefore, the particular solution for the equation in this example is 
1 1 
yp(t) = si et + 7 (3cos(t) — 5sin(¢)). 
Using the general solution theorem for nonhomogeneous equations we obtain 


1 1 
Ygen(t) = c,e"* + ce * e*' + — (3cos(t) — 5sin(t)). 
2 il 
a4) 


2.5.3. The Variation of Parameters Method. This method provides a second way 
to find a particular solution y, to a nonhomogeneous equation L(y) = f. We summarize 
this method in formula to compute y, in terms of any pair of fundamental solutions to the 
homogeneous equation L(y) = 0. The variation of parameters method works with second 
order linear equations having variable coefficients and contiuous but otherwise arbitrary 
sources. When the source function of a nonhomogeneous equation is simple enough to 
appear in Table 1 the undetermined coefficients method is a quick way to find a particular 
solution to the equation. When the source is more complicated, one usually turns to the 
variation of parameters method, with its more involved formula for a particular solution. 


Theorem 2.5.4 (Variation of Parameters). A particular solution to the equation 
Ly) =f, 
with L(y) = y” + a(t) y’ + ao(t) y and ay, do, f continuous functions, is given by 
Yp = U1Y1 + U2Ya; 


where Y1, Y2 are fundamental solutions of the homogeneous equatio L(y) = 0 and the func- 
tions Uz, Uz are defined by 


yo(t) f(t) 
. Wyrys (t) "7 


where Wy,y. is the Wronskian of y: and Yo. 


_ fw FO | 


id = | Wry 


Ua(t) a (2.5.3) 


The proof is a generalization of the reduction order method. Recall that the reduction 
order method is a way to find a second solution y2 of an homogeneous equation if we already 
know one solution y,. One writes y, = uy, and the original equation L(y.) = 0 provides an 
equation for u. This equation for u is simpler than the original equation for yy because the 
function y, satisfies L(y,) = 0. 
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The formula for y, can be seen as a generalization of the reduction order method. We 
write y, in terms of both fundamental solutions y,, y. of the homogeneous equation, 


Yp(t) = us(t) yr(t) + Ua(t) walt). 
We put this y, in the equation L(y,) = f and we find an equation relating wu; and uy. It 
is important to realize that we have added one new function to the original problem. The 
original problem is to find y,. Now we need to find u; and uz, but we still have only one 
equation to solve, L(y,) = f. The problem for u,, uw, cannot have a unique solution. So we 
are completely free to add a second equation to the original equation L(y,) = f. We choose 
the second equation so that we can solve for u,; and ws. 


Proof of Theorem 2.5.4: Motivated by the reduction of order method we look for a yp 
Yp = U1 Y1 + Us Yr. 


We hope that the equations for u,, uz will be simpler to solve than the equation for y,. But 
we started with one unknown function and now we have two unknown functions. So we are 
free to add one more equation to fix u,, u2. We choose 


U4 Ya + Uy Yo = 0. 


/ 
In other words, we choose uz = [-3 u; dt. Let’s put this y, into L(yp) = f. We need y/, 


/ 
2 


(and recall, ul, ys + us Yo = 0) 
Yp = Uy Yr FU Yi + Uy Yo + U2Ye >  Yy = UY, + UY. 
and we also need y/,, 
Yp = UY, + Un yy + Uy Ys + U2 Yo- 
So the equation L(y,) = f is 


(uy Yr + Ur Yy + UD YD + Us Ys) + Gi(Ur Yy + U2 YQ) + Go(Us Yr + Us Yo) = f 
We reorder a few terms and we see that 
Us Yh + Uy Yo + Us (yy! + a Yi + Go Yr) + Us (Ye + As Yo + Mo Yo) = fF. 
The functions y, and y, are solutions to the homogeneous equation, 
yy + a1 yf, + do y1 = 0, Ys +41 Y5 + do Yo = 0, 
so u, and uz must be solution of a simpler equation that the one above, given by 
UY + UZ ys = f. (2.5.4) 
So we end with the equations 
Uy Yh + Uz Ye = f 


Us Ys + Uy Yo = 0. 


And this is a 2 x 2 algebraic linear system for the unknowns uj, uj. It is hard to overstate 
the importance of the word “algebraic” in the previous sentence. From the second equation 
above we compute us, and we introduce it in the first equation, 


/ hap / 
p=-B ul > uy - Bu ap = of (Be) =F 


U 1 
Yo Yo Yo 
Recall that the Wronskian of two functions is Wi. = yy} — yi Yo, we get 
u, — Yof = Pi wh 


Uy = : 
Wi Wi 
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These equations are the derivative of Eq. (2.5.3). Integrate them in the variable t and choose 
the integration constants to be zero. We get Eq. (2.5.3). This establishes the Theorem. 


Remark: The integration constants in the expressions for u,, U2. can always be chosen to 
be zero. To understand the effect of the integration constants in the function yp, let us do 
the following. Denote by u, and uz the functions in Eq. (2.5.3), and given any real numbers 
Cc, and cy define 


Uy = U1. + Ci, Up = Up + Co. 
Then the corresponding solution J is given by 
Up = Uy Yr + Ue Yo = Ut Ya + U2 Yo FOr Yr + C2Yo = Up = Vp FLY + Yr- 


The two solutions y, and y, differ by a solution to the homogeneous differential equation. 
So both functions are also solution to the nonhomogeneous equation. One is then free to 
choose the constants c, and c, in any way. We chose them in the proof above to be zero. 


Example 2.5.6. Find the general solution of the nonhomogeneous equation 
y” — By’ + by = 2e’. 
Solution: The formula for y, in Theorem 2.5.4 requires we know fundamental solutions to 


the homogeneous problem. So we start finding these solutions first. Since the equation has 
constant coefficients, we compute the characteristic equation, 


r?—5r+6=0 re = =(54V25—-24) = { 


r, = 3, 


r_=2 
So, the functions y, and y, in Theorem 2.5.4 are in our case given by 
w(t)=e", — ya(t) =e. 
The Wronskian of these two functions is given by 
Wan b= Ce) —-Ge ye) =| Wit) =—e™ 


We are now ready to compute the functions u, and u2. Notice that Eq. (2.5.3) the following 
differential equations 


fe Yof Po, wf 
ul=— ; 1 ; 
Was yo Wrryp 
So, the equation for u, is the following, 
uae \(-e ™ = wa2e" => wo=-e™, 
mee Cee = weaete* = ap2=—22", 


where we have chosen the constant of integration to be zero. The particular solution we are 
looking for is given by 


Yp = (—e*)(e%) + (Zee) > op =e’. 
Then, the general solution theorem for nonhomogeneous equation implies 


Yeen(t) = ey epee” te c.,c. ER. 
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Example 2.5.7. Find a particular solution to the differential equation 
Py" — 2y = 30 —1, 
knowing that y, = t? and y, = 1/t are solutions to the homogeneous equation t7y’’ — 2y = 0. 


Solution: We first rewrite the nonhomogeneous equation above in the form given in Theo- 
rem 2.5.4. In this case we must divide the whole equation by t?, 


2 1 1 
ee “g > f(t) =3— 5. 
We now proceed to compute the Wronskian of the fundamental solutions y,, y2, 
-1 1 
Wrsalt) = ()(=)-@0(5) > Walt) =-3 

We now use the equation in (2.5.3) to obtain the functions u; and us, 

1 1) 1 1\ 1 

Pa ah) ae 5=()(8-s)S 

vs a pee = (6) (3- @) 3 
1 1 1 1 1 1 

a = In(t)+ 17 =-fM+. => == + St. 

t 3 ty os ae ae 


A particular solution to the nonhomogeneous equation above is jp) = Uz,Y, + Usy2, that: is, 


18 +#)(¢-") 


é 1 
ip = [Inte + gt] 0) + 5 
1 1 1 
=f In(t)+—2—--P 42 
tIn(t) + & 3t 5 
1 1 
=Uini)+5-3¢ 
1 1 


However, a simpler expression for a solution of the nonhomogeneous equation above is 


il 
Yp =U In(t) + - 


<q 


Remark: Sometimes it could be difficult to remember the formulas for functions wu; and ug 
in (2.5.3). In such case one can always go back to the place in the proof of Theorem 2.5.4 
where these formulas come from, the system 


roe Pop 
UY, + UY, = f 


UY + UsYo = 0. 


The system above could be simpler to remember than the equations in (2.5.3). We end this 
Section using the equations above to solve the problem in Example 2.5.7. Recall that the 
solutions to the homogeneous equation in Example 2.5.7 are y,(t) = ¢?, and y(t) = 1/t, 
while the source function is f(t) = 3—1/t?. Then, we need to solve the system 


i 
Puy tuys =O, 
1 1 
atu! tut ) 3- = 
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This is an algebraic linear system for u’, and u),. Those are simple to solve. From the equation 
on top we get uj, in terms of uw, and we use that expression on the bottom equation, 


1 1 1 
Substitue back the expression for u/, in the first equation above and we get us. We get, 
oe 
eB 
1 
w= P+. 
2 3 


We should now integrate these functions to get u; and uz and then get the particular solution 
Yp = UiY1 + Usy2. We do not repeat these calculations, since they are done Example 2.5.7. 


2.5.4. Exercises. 


2.5.1.- . 
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2.5.2.- . 
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2.6. Applications 


Different physical systems are mathematically identical. In this Section we show that 
a weight attached to a spring, oscillating either in air or under water, is mathematically 
identical to the behavior of an electric current in a circuit containing a resistance, a capacitor, 
and an inductance. Mathematical identical means that both systems are described by the 
same differential equation. 


2.6.1. Review of Constant Coefficient Equations. In Section 2.3 we have seen 
how to find solutions to second order, linear, constant coefficient, homogeneous, differential 
equations, 


y +ay tay=0, a4,a, ER. (2.6.1) 


Theorem 2.3.2 provides formulas for the general solution of this equation. We review here 
this result, and at the same time we introduce new names describing these solutions, names 
that are common in the physics literature. The first step to obtain solutions to Eq. (2.6.1) 
is to find the roots or the characteristic polynomial p(r) = r? + a,r + do, which are given by 


rh= + a? 


= 2 2 : 


We then have three different cases to consider. 


. . : 2 = ee . . 
0 : 
(a) A system is over damped in the case that aj — 4a, > 0. In this case the characteristic 
polynomial has real and distinct roots, r,, r-, and the corresponding solutions to the 
differential equation are 


HH=e°%. war. 


So the solutions are exponentials, increasing or decreasing, according whether the roots 
are positive or negative, respectively. The decreasing exponential solutions originate the 
name over damped solutions. 

(b) A system is critically damped in the case that a?—4a) = 0. In this case the characteristic 
polynomial has only one real, repeated, root, * = —a,/2, and the corresponding solutions 
to the differential equation are then, 


y(t) -_ em yo(t) = te ut/2, 


(c) A system is under damped in the case that a? — 4a, < 0. In this case the characteristic 
polynomial has two complex roots, r+ = a+(i, where one root is the complex conjugate 
of the other, since the polynomial has real coefficients. The corresponding solutions to 
the differential equation are 


y(t) =e™ cos(Gt), —-ya(t) = e** sin(Bt). 
1 
where @ = me and 6 = 3 V Ado —a?. 


(d) A system is wundamped when is under pamper with a, = 0. Therefore, the characteristic 
polynomial has two pure imaginary roots r, = +,/a,. The corresponding solutions are 
oscillatory functions, 


Yys(t) = cos(wot), yo(t) = sin(wot). 


where Wp = ,/do- 
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2.6.2. Undamped Mechanical Oscillations. Springs are curious objects, when you 
slightly deform them they create a force proportional and in opposite direction to the de- 
formation. When you release the spring, it goes back to its original size. This is true for 
small enough deformations. If you stretch the spring long enough, the deformations are 
permanent. 


Definition 2.6.1. A spring is an object that when deformed by an amount Al creates a 
force F, = —k Al, with k > 0. 


Consider a spring-body system as shown in Fig. 2.6.2. A spring is fixed to a ceiling and 
hangs vertically with a natural length /. It stretches by Al when a body with mass m is 
attached to its lower end, just as in the middle spring in Fig. 2.6.2. We assume that the 
weight m is small enough so that the spring is not damaged. This means that the spring acts 
like a normal spring, whenever it is deformed by an amount Al it makes a force proportional 
and opposite to the deformation, 

Foo = —k Al. 


Here k > 0 is a constant that depends on the type of spring. Newton’s law of motion imply 
the following result. 


Theorem 2.6.2. A spring-body system with spring constant k, body mass m, at rest with 
a spring deformation Al, within the rage where the spring acts like a spring, satisfies 


mg =kAI. 


Proof of Theorem 2.6.2: Since the spring-body system is at rest, Newton’s law of motion 
imply that all forces acting on the body must add up to zero. The only two forces acting on 
the body are its weight, Fz = mg, and the force done by the spring, F's5 = —k Al. We have 
used the hypothesis that Al is small enough so the spring is not damaged. We are using 
the sign convention displayed in Fig. 2.6.2, where forces pointing downwards are positive. 


As we said above, since the body is at rest, 
the addition of all forces acting on the body 
must vanish, 


Fy + Fso = 0 mg =kAl. 
This establishes the Theorem. 


Remark: Rewriting the equation above as 
mg 
k=. 
Al 
it is possible to compute the spring constant k 
by measuring the displacement Al and know- 


ing the body mass m. 


FIGURE 2. Springs with weights. 


We now find out how the body will move when we take it away from the rest position. 
To describe that movement we introduce a vertical coordinate for the displacements, y, as 
shown in Fig. 2.6.2, with y positive downwards, and y = 0 at the rest position of the spring 
and the body. The physical system we want to describe is simple; we further stretch the 


2.6. APPLICATIONS 133 


spring with the body by yo and then we release it with an initial velocity vp. Newton’s law 
of motion determine the subsequent motion. 


Theorem 2.6.3. The vertical movement of a spring-body system in air with spring constant 
k > 0 and body mass m > 0 ts described by the solutions of the differential equation 
my" +ky=0, (2.6.2) 


where y is the vertical displacement function as shown in Fig. 2.6.2. Furthermore, there is 
a unique solution to Eq. (2.6.2) satisfying the initial conditions y(0) = yo and y’(0) = vp, 


y(t) = A cos(wot — 4), 


Lk 
with angular frequency wo = 4/—, where the amplitude A > 0 and phase-shift ¢ € (—7, 7], 
m 


2 ue Vo 
A= 4/yet+—) = arctan( : 
Wo WoYo 


Remark: The angular or circular frequency of the system is wy) = \/k/m, meaning that 
the motion of the system is periodic with period given by T = 27/w», which in turns implies 
that the system frequency is Vp = wo/(27). 


Proof of Theorem 2.6.3: Newton’s second law of motion says that mass times acceleration 
of the body my’’(t) must be equal to the sum of all forces acting on the body, hence 


my (t) = F, + Feo + F(t), 


where F(t) = —k y(t) is the force done by the spring due to the extra displacement y. 
Since the first two terms on the right hand side above cancel out, Fy + Fso = 0, the body 
displacement from the equilibrium position, y(t), must be solution of the differential equation 


my (t)+ky(t) =0. 


which is Eq. (2.6.2). In Section ?? we have seen how to solve this type of differential 
equations. The characteristic polynomial is p(r) = mr? +k, which has complex roots 


r4 = +w?i, where we introduced the angular or circular frequency of the system, 
k 
Wy = 4/—. 
m 


The reason for this name is the calculations done in Section ??, where we found that a 
real-valued expression for the general solution to Eq. (2.6.2) is given by 


Yeen(t) = cy cos(wot) + Cc, sin(wot). 


This means that the body attached to the spring oscillates around the equilibrium position 
y = 0 with period T = 27/w», hence frequency 1 = wo/(27). There is an equivalent way 
to express the general solution above given by 


Ygen(t) = A cos(wot — @). 
These two expressions for Ygen are equivalent because of the trigonometric identity 
Acos(wot — ¢) = Acos(wot) cos(¢) + A sin(wot) sin(¢), 
which holds for all A and ¢, and wot. Then, it is not difficult to see that 
aarti A=Va+c, 
> 


Co = Asin(@). o= arctan(=). 
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Since both expressions for the general solution are equivalent, we use the second one, in 
terms of the amplitude and phase-shift. The initial conditions y(0) = y and y’(0) = %} 
determine the constants A and ¢. Indeed, 


2 
Yo = y(0) = A cos(¢), = A = Ye am we 
Up = y' (0) = Aw sin(d) Uo 
o= arctan ( ) : 
WoYo 


This establishes the Theorem. 


Example 2.6.1. Find the movement of a 50 gr mass attached to a spring moving in air 
with initial conditions y(0) = 4cm and y'(0) = 40cm/s. The spring is such that a 30 gr 
mass stretches it 6cm. Approximate the acceleration of gravity by 1000 cm/s?. 


Solution: Theorem 2.6.3 says that the equation satisfied by the displacement y is given by 
my” + ky = 0. 

In order to solve this equation we need to find the spring constant, k, which by Theorem 2.6.2 
is given by k = mg/Al. In our case when a mass of m = 30 gr is attached to the sprint, it 
stretches Al = 6cm, so we get, 

(30) (1000) 

6 

Knowing the spring constant k we can now describe the movement of the body with mass 


m = 50gr. The solution of the differential equation above is obtained as usual, first find the 
roots of the characteristic polynomial 


[k | 1 
mr +k=0 TE =twyot, wo =4f/—= x > w=10-. 
m 50 s 


We write down the general solution in terms of the amplitude A and phase-shift ¢, 
y(t) = A cos(wt—¢) => y(t)=Acos(l0t— ¢). 

To accommodate the initial conditions we need the function y'(t) = —Awy sin(w ot — ¢). The 
initial conditions determine the amplitude and phase-shift, as follows, 
(0) = Acos(¢), rane Fe 
: = o = arcta (~. ) 

= ar n : 

(10)(4) 

We obtain that A = 4\/2 and tan(#) = 1. The later equation implies that either ¢ = 7/4 or 
@ = —3n/4, for @ € (—7,7]. If we pick the second value, ¢ = —37/4, this would imply that 


y(0) < 0 and y’(0) < 0, which is not true in our case. So we must pick the value ¢ = 7/4. 
We then conclude: 


k= 


k = 5000 5. 
Ss 


y(t) = 4V2 cos(10t — *). 


J 


2.6.3. Damped Mechanical Oscillations. Suppose now that the body in the spring- 
body system is a thin square sheet of metal. If the main surface of the sheet is perpendicular 
to the direction of motion, then the air dragged by the sheet during the spring oscillations will 
be significant enough to slow down the spring oscillations in an appreciable time. One can 
find out that the friction force done by the air opposes the movement and it is proportional 
to the velocity of the body, that is, Fg = —dy’(t). We call such force a damping force, where 
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d > 0 is the damping coefficient, and systems having such force damped systems. We now 
describe the spring-body system in the case that there is a non-zero damping force. 


Theorem 2.6.4. 


(a) The vertical displacement y, function as shown in Fig. 2.6.2, of a spring-body system 
with spring constant k > 0, body mass m > 0, and damping constant d > 0, is described 
by the solutions of 


my’ +dy +ky=0, (2.6.3) 
(b) The roots of the characteristic polynomial of Eq. (2.6.3) are r- = —wa + \/w4 — wG, 
d k 
with damping coefficient wa = in and circular frequency Wy = 4/—. 
m m 


(c) The solutions to Eq. (2.6.3) fall into one of the following cases: 
(i) A system with wg > wo is called over damped, with general solution to Eq. (2.6.3) 


y(t) =ce™ +c.e™. 


Ss 


(i 


A system with wa = wo is called critically damped, with general solution to Eq. (2.6.8) 
yi) =ce "f+ ete 


(iti) A system with wa < We is called under damped, with general solution to Eq. (2.6.3) 
y(t) = Ae~**" cos(t — 4), 


where B = \/w? — wi. 


(d) There is a unique solution to Eq. (2.6.2) with initial conditions y(0) = yo and y'(0) = vo. 


Remark: In the case the damping coefficient vanishes we recover Theorem 2.6.3. 


Proof of Therorem 2.6.3: Newton’s second law of motion says that mass times acceler- 
ation of the body my”(t) must be equal to the sum of all forces acting on the body. In the 
case that we take into account the air dragging force we have 


my" (t) = Fy + Fso + Fs(t) + Fa(t), 


where F(t) = —k y(t) as in Theorem 2.6.3, and Fy(t) = —dy’(t) is the air -body dragging 
force. Since the first two terms on the right hand side above cancel out, Fy + Fso = 0, 
as mentioned in Theorem 2.6.2, the body displacement from the equilibrium position, y(t), 
must be solution of the differential equation 


my" (t)++dy'(t) +k y(t) =0. 


which is Eq. (2.6.3). In Section ?? we have seen how to solve this type of differential 
equations. The characteristic polynomial is p(r) = mr? + dr +k, which has complex roots 
d d 7 k 7 5 


r= —— [-d + Vd? — 4mk] Sa ( Te = Wd © \/ Wg — Wo. 


2m m 


d lk 
where wg = am and wy) = ,/—. In Section ?? we found that the general solution of a 
m m 


differential equation with a characteristic polynomial having roots as above can be divided 
into three groups. For the case r, 4 r- real valued, we obtain case (ci), for the case r, = r- 
we obtain case (cii). Finally, we said that the general solution for the case of two complex 
roots r+ = a+ (i was given by 


yt) = e™ (c; cos(St) + cz sin(£t)). 
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In our case @ = —wq and 8 = \/w? — w4. We now rewrite the second factor on the right-hand 
side above in terms of an amplitude and a phase shift, 
y(t) = Ae “#* cos(Bt — ¢). 


The main result from Section ?? says that the initial value problem in Theorem 2.6.4 has a 
unique solution for each of the three cases above. This establishes the Theorem. 


Example 2.6.2. Find the movement of a 5Kg mass attached to a spring with constant k = 
5Kg/ Secs” moving in a mediuith damping constant d = 5Kg/Secs, with initial conditions 
y(0) = V3 and y'(0) = 0. 

Solution: By Theorem 2.6.4 the differential equation for this system is my” + dy’ + ky = 0, 
with m= 5, k =5, d=5. The roots of the characteristic polynomial are 


/ d 1 Lk 
T+ = Wg w — we, Oe Gy oP Wo = “oe 


tL = Bi l= =a. 
7 2 4 2 2 


This means our system has under damped oscillations. Following Theorem 2.6.4 part (ciii), 
the general solution is given by 


that is, 


y(t) = Ae’? cos(¥2 t— 0). 


We only need to introduce the initial conditions into the expression for y to find out the 
amplitude A and phase-shift ¢. In order to do that we first compute the derivative, 


y(t) = -} Ae" cos( “2 ¢—4) - VB get? sin( 0). 


The initial conditions in the example imply, 
= 1 3 
V3 = y(0) = Acos(¢), 0=y'(0) = = Acos(¢) + ue Asin(@). 
The second equation above allows us to compute the phase-shift, since 


1 T om 
t i ue =. 
If ¢ = —57/6, then y(0) < 0, which is not out case. Hence we must choose ¢ = 7/6. With 
that phase-shift, the amplitude is given by 


V3 = Acos(*) =4¥3 => A=2. 


3 
We conclude: y(t) = 2e~*/? cos (3 t— a Z 


or b=-ma- 


2.6.4. Electrical Oscillations. We describe the electric current flowing through an 
RLC-series electric circuit, which consists of a resistance, a coil, and a capacitor connected 
in series as shown in Fig. 3. A current can be started by approximating a magnet to the coil. 
If the circuit has low resistance, the current will keep flowing through the coil between the 
capacitor plates, endlessly. There is no need of a power source to keep the current flowing. 
The presence of a resistance transforms the current energy into heat, damping the current 
oscillation. 
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This system is described by an _ integro- | | OOO 
differential equation found by Kirchhoff, now VN L 
called Kirchhoff’s voltage law, relating the re- C 


sistor R, capacitor C, inductor L, and the 
current J in a circuit as follows, 


5 fe I(t) = electric current 
L(t) +RI(t) +5 / I(s) ds =0. (2.6.4) 
i. Ficure 3. An RLC circuit. 


Kirchhoff’s voltage law is all we need to present the following result. 


Theorem 2.6.5. The electric current I in an RLC circuit with resistance R > 0, capaci- 
tance C' > 0, and inductance L > 0, satisfies the differential equation 


LI"(t)+RI(t) + a I(t) =0. 


The roots of the characteristic polynomial of Eq. (2.6.3) are rz = —wa + \/w4 — we, with 


1 
damping coefficient wq = x and circular frequency Wo = To’ 
in Theorem 2.6.4 parts (c), (d), hold with wa and wo defined here. 


Furthermore, the results 


Proof of Theorem 2.6.5: Compute the derivate on both sides in Eq. (2.6.4), 


1 
LI"(@)+ RI'(e)+ fa I(t) =0, 
and divide by L, 
I'(t) + 2(<-) (Oe 1H=0 
21 LC ; 
iL 
Introduce wg = a and w)p = —==, then Kirchhoff’s law can be expressed as the second 


2L 
order, homogeneous, constant coefficients, differential equation 


I" + Qual +u2I =0. 


The rest of the proof follows the one of Theorem 2.6.4. This establishes the Theorem. 


Example 2.6.3. Find real-valued fundamental solutions to I” + 2wq I’ + w? I = 0, where 
wq = R/(2L), w? = 1/(LC), in the cases (a), (b) below. 


Solution: The roots of the characteristic polynomial, p(r) = r? + 2war + w?, are given by 
1 
= [—2wa + 4/4w? — wi? re = —wg t 4/w? — v2. 


Case (a): R= 0. This implies wg = 0, so rz = iw. Therefore, 


I,(t) = cos(wot), I(t) = sin(wot). 


Remark: When the circuit has no resistance, the current oscillates without dissipation. 


Case (b): R < ./4L/C. This implies 
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Therefore, the characteristic polynomial has complex roots rp = —wg + i,/w? — w4, hence 
the fundamental solutions are 

I,(t) = e~*** cos(6t), 

L(t) =e °* sin(B 2), 
with 8 = \/w? —w%. Therefore, the resistance R damps the current oscillations produced 
by the capacitor and the inductance. <J 


FIGURE 4. Typical currents [,, I, for case (b). 
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2.6.5. Exercises. 


2.6.1.- . 2.6.2.- . 


CHAPTER 3 


Power Series Solutions 


The first differential equations were solved around the end of the seventeen century and 
beginning of the eighteen century. We studied a few of these equations in § 1.1-1.4 and the 
constant coefficients equations in Chapter 2. By the middle of the eighteen century people 
realized that the methods we learnt in these first sections had reached a dead end. One reason 
was the lack of functions to write the solutions of differential equations. The elementary 
functions we use in calculus, such as polynomials, quotient of polynomials, trigonometric 
functions, exponentials, and logarithms, were simply not enough. People even started to 
think of differential equations as sources to find new functions. It was matter of little time 
before mathematicians started to use power series expansions to find solutions of differential 
equations. Convergent power series define functions far more general than the elementary 
functions from calculus. 

In § 3.1 we study the simplest case, when the power series is centered at a regular 
point of the equation. The coefficients of the equation are analytic functions at regular 
points, in particular continuous. In § ?? we study the Euler equidimensional equation. The 
coefficients of an Euler equation diverge at a particular point in a very specific way. No 
power series are needed to find solutions in this case. In § 3.2 we solve equations with regular 
singular points. The equation coefficients diverge at regular singular points in a way similar 
to the coefficients in an Euler equation. We will find solutions to these equations using the 
solutions to an Euler equation and power series centered precisely at the regular singular 
points of the equation. 
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3.1. Solutions Near Regular Points 
We study second order linear homogeneous differential equations with variable coefficients, 
y" + p(x) y' + 94(x)y = 0. 
We look for solutions on a domain where the equation coefficients p, q are analytic functions. 
Recall that a function is analytic on a given domain iff it can be written as a convergent 
power series expansions on that domain. In Appendix B we review a few ideas on analytic 
functions and power series expansion that we need in this section. A regular point of the 
equation is every point where the equation coefficients are analytic. We look for solutions 
that can be written as power series centered at a regular point. For simplicity we solve only 


homogeneous equations, but the power series method can be used with nonhomogeneous 
equations without introducing substantial modifications. 


3.1.1. Regular Points. We now look for solutions to second order linear homogeneous 
differential equations having variable coefficients. Recall we solved the constant coefficient 
case in Chapter 2. We have seen that the solutions to constant coefficient equations can 
be written in terms of elementary functions such as quotient of polynomials, trigonometric 
functions, exponentials, and logarithms. For example, the equation 

y +y=0 
has the fundamental solutions y,(2) = cos(a) and y(x) = sin(x). But the equation 
cy’ +y +ey=0 

cannot be solved in terms of elementary functions, that is in terms of quotients of poly- 
nomials, trigonometric functions, exponentials and logarithms. Except for equations with 
constant coefficient and equations with variable coefficient that can be transformed into 
constant coefficient by a change of variable, no other second order linear equation can be 
solved in terms of elementary functions. Still, we are interested in finding solutions to vari- 
able coefficient equations. Mainly because these equations appear in the description of so 
many physical systems. 

We have said that power series define more general functions than the elementary func- 


tions mentioned above. So we look for solutions using power series. In this section we center 
the power series at a regular point of the equation. 


Definition 3.1.1. A point x) € R is called a regular point of the equation 
y" + p(x) y’ + a(x) y =0, (3.1.1) 


iff p, q are analytic functions at x). Otherwise xo is called a singular point of the equation. 


Remark: Near a regular point x, the coefficients p and q in the differential equation above 
can be written in terms of power series centered at Zo, 


co 
p(x) = po + pi (w — 20) + po (a — ao)? +++» = D> pp (@— a)”, 
n=0 
co 
g(a) = 90 + 41 (@ — ao) + G2 (a — a)? +--+ = >> On (@— 20)”, 
n=0 


and these power series converge in a neighborhood of 29. 


Example 3.1.1. Find all the regular points of the equation 
ry’ +y' +a? y=0. 
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Solution: We write the equation in the form of Eq. (3.1.1), 
W 1 i: 
yo + es y +xry=0. 


In this case the coefficient functions are p(a#) = 1/x, and q(a) = x. The function q is analytic 
in R. The function p is analytic for all points in R — {0}. So the point 2) = 0 is a singular 
point of the equation. Every other point is a regular point of the equation. <J 


3.1.2. The Power Series Method. The differential equation in (3.1.1) is a particular 
case of the equations studied in § 2.1, and the existence result in Theorem 2.1.2 applies to 
Eq. (3.1.1). This Theorem was known to Lazarus Fuchs, who in 1866 added the following: If 
the coefficient functions p and q are analytic on a domain, so is the solution on that domain. 
Fuchs went ahead and studied the case where the coefficients p and q have singular points, 
which we study in § 3.2. The result for analytic coefficients is summarized below. 


Theorem 3.1.2. If the functions p, q are analytic on an open interval (a) — p,%o +p) CR, 
then the differential equation 


y" +p(x)y' + q(x) y =, 
has two independent solutions, y1, y2, which are analytic on the same interval. 


Remark: A complete proof of this theorem can be found in [2], Page 169. See also [10], 
§ 29. We present the first steps of the proof and we leave the convergence issues to the latter 
references. The proof we present is based on power series expansions for the coefficients p, 
q, and the solution y. This is not the proof given by Fuchs in 1866. 


Proof of Thorem 3.1.2: Since the coefficient functions p and q are analytic functions on 
(Xo — P,Xo + p), where p > 0, they can be written as power series centered at Xo, 


co co 
p(z) => pn(@—2a)", a(t) = >) an (@— 20)”. 
n=0 n=0 
We look for solutions that can also be written as power series expansions centered at 2o, 


y(a) = » Gn (@ — Xo)”. 
n=0 


We start computing the first derivatives of the function y, 
co co 
y(t) = >) nan (@—a)Y => y(e) => onan (e- mo), 
n=0 n=1 


where in the second expression we started the sum at n = 1, since the term with n = 0 
vanishes. Relabel the sum with m = n— 1, so when n = 1 we have that m = 0, and 
n=m-+1. Therefore, we get 


y (x) = oe (m+ leans) (@ = 20)”. 


m=0 


We finally rename the summation index back to n, 


co 


y (2) = So (n+ lang (@ — ao)”. (3.1.2) 


n=0 
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From now on we do these steps at once, and the notation n — 1 = m— n means 


=> Ny, (& — Lo) = =o + Magi (2 m5)”, 


n=0 
We continue computing on second derivative of function y, 
co 
y" (a) = So n(n — 1an (@ — a0) -), 
n=2 


and the transformation n — 2 =m — n gives us the expression 
co 
y (x) = DG: + 2)(n + 1)@(n42) (2 — to)”. 
n=0 
The idea now is to put all these power series back in the differential equation. We start 


with the term 
= = (Doan XL — Xo) s) (Do an( XL — Xo) “ 
= (Yo aran-e) (e— 0)" 


n=0 k=0 
where the second expression above comes from standard results in power series multiplica- 
tion. A similar calculation gives 


n=0 atl 
/ =>. (d k+1) a(k41)Pn—k) (x — 2)”. 


n=0 k=0 
Therefore, the differential equation y” + p(x) y’ + q(x) y = 0 has now the form 


>. [(n +2)(n+ 1)a(n+2) + S- [(k + 1)4(441)P(n—k) F andar] | (x — %)” = 0. 
n=0 k=0 


So we obtain a recurrence relation for the coefficients an, 


(n + 2)(n + 1)ain4a) + S- [(K + l)aceg1)P(n—k) + 4eI(n—K)] = 9, 


k=0 
for n = 0,1,2,---. Equivalently, 
1 n 
A(n+2) = — Cea Cea S- [(k + 1)a(e+1)P(n—k) + GkQ(n—k)- (3.1.3) 
k=0 
We have obtained an expression for a(,42) in terms of the previous coefficients a(,+41),*+* , @o 


and the coefficients of the function p and q. If we choose arbitrary values for the first two 
coefficients ag and aj, the the recurrence relation in (3.1.3) define the remaining coefficients 
a2,@3,::: in terms of ao and a,. The coefficients a, chosen in such a way guarantee that 
the function y defined in (3.1.2) satisfies the differential equation. 

In order to finish the proof of Theorem 3.1.2 we need to show that the power series 
for y defined by the recurrence relation actually converges on a nonempty domain, and 
furthermore that this domain is the same where p and gq are analytic. This part of the 
proof is too complicated for us. The interested reader can find the rest of the proof in [2], 
Page 169. See also [10], § 29. 
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It is important to understand the main ideas in the proof above, because we will follow 
these ideas to find power series solutions to differential equations. So we now summarize 
the main steps in the proof above: 


a) Write a power series expansion of the solution centered at a regular point 29, 
g 
co 
y(x“) = ys An (LZ — Xo)”. 
n=0 


(b) Introduce the power series expansion above into the differential equation and find a 
recurrence relation among the coefficients an. 


(c) Solve the recurrence relation in terms of free coefficients. 
(d) If possible, add up the resulting power series for the solutions y,, yo. 


We follow these steps in the examples below to find solutions to several differential 
equations. We start with a first order constant coefficient equation, and then we continue 
with a second order constant coefficient equation. The last two examples consider variable 
coefficient equations. 


Example 3.1.2. Find a power series solution y around the point 2) = 0 of the equation 
y +cy=0, ceR. 
Solution: We already know every solution to this equation. This is a first order, linear, 
differential equation, so using the method of integrating factor we find that the solution is 
y(t) = ae", a ER. 


We are now interested in obtaining such solution with the power series method. Although 
this is not a second order equation, the power series method still works in this example. 
Propose a solution of the form 


loc) Co 
y= S- anv” => y= S- nan oY, 
n=0 n=1 


We can start the sum in y’ at n = 0 or n = 1. We choose n = 1, since it is more convenient 
later on. Introduce the expressions above into the differential equation, 


co co 
s Nan vr 1 +e y An xv” = 0. 
n=1 n=0 


Relabel the first sum above so that the functions 7”~! and x” in the first and second sum 
have the same label. One way is the following, 


PaG: + Deg e + y, Ca, 2" =0 
n=0 n=0 


We can now write down both sums into one single sum, 


Co 


(n+ l)d(n41) + Can| 2” = 0. 
(n+1) 


n=0 
Since the function on the left-hand side must be zero for every x € R, we conclude that 
every coefficient that multiplies «” must vanish, that is, 


(n+ lamn4iy + Can = 0, n> 0. 
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The last equation is called a recurrence relation among the coefficients a,. The solution of 
this relation can be found by writing down the first few cases and then guessing the general 
expression for the solution, that is, 


n=0, a, = —CAy => ad, = —CQo, 
a 
n=1, 2a, = —-ca, => Ap = By > 
3 
n= 2, 3a3 = —Ca, => ds = — ay Gos 
A 
n= 3, 4a4 = —ca3 => a4 = qo 


which implies that the solution of the differential equation is given by 


y(x) = ao ews zc => y(r) =a S- Cen)" => y(r)=ae°*. 
n=0 : n=0 , 


dq 
Example 3.1.3. Find a power series solution y(x) around the point 2) = 0 of the equation 
y +y=0. 


Solution: We know that the solution can be found computing the roots of the characteristic 
polynomial r? + 1 = 0, which gives us the solutions 


y(x) = do cos(a) + a, sin(x). 


We now recover this solution using the power series, 
co co co 
y= S- a,c” > y= S- nant’), > yl = S- n(n —1)ay 2-?), 
n=0 n=1 n=2 


Introduce the expressions above into the differential equation, which involves only the func- 
tion and its second derivative, 


Co Co 
S- n(n —1)an 2”? + > An v2” = 0. 
n=2 n=0 


Relabel the first sum above, so that both sums have the same factor 7”. One way is, 


SY (n + 2)(n + 1)a~n42) 2” + S- ay, x" = 0. 
n=0 n=0 


Now we can write both sums using one single sum as follows, 


S- [(n + 2)(n + l)amr42) +@n] 27> =0 > (n+ 2)(n+1ainze) tan =0. n ZO. 


n=0 
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The last equation is the recurrence relation. The solution of this relation can again be found 
by writing down the first few cases, and we start with even values of n, that is, 


1 
n=0, (2)(1)ay = —ao => dp = — 5 Ao 
1 
n= 2, (4)(3)a4 = —ay => a4 = Fy Ao, 
1 
n=A, (6)(5)ag = —a4 > dg = — Fy Ao- 


One can check that the even coefficients a2, can be written as 


et is 
Qa2k = Qk)!” 


The coefficients a,, for the odd values of n can be found in the same way, that is, 


1 
n=1, (3)(2)a3 = —a, > da = — 3 Oa, 
a 
n=8, (5)(4)a5 = —as > a5 = Fy 
1 
n=5, (7)(6)a7 = —a5 => a7 = — 7 a. 


One can check that the odd coefficients a2,41 can be written as 


i" 


Qok41 = Qk+1)! ay 


Split the sum in the expression for y into even and odd sums. We have the expression for 
the even and odd coefficients. Therefore, the solution of the differential equation is given by 


co k co 
ae g2kti 
= Ao - 
ae ! = one ~ 1)! 


One can check that these are precisely the power series representations of the cosine and 
sine functions, respectively, 


y(x) = a cos(a) + a, sin(x). 
dq 


Example 3.1.4. Find the first four terms of the power series expansion around the point 
Xo = 1 of each fundamental solution to the differential equation 


y’ —zy' —y=0. 
Solution: This is a differential equation we cannot solve with the methods of previous 


sections. This is a second order, variable coefficients equation. We use the power series 
method, so we look for solutions of the form 


y= So n(x — 1)” => y= >> nan(a — 1)" f= 3 ne eae 1™ 
= n=2 
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We start working in the middle term in the differential equation. Since the power series is 
centered at 2) = 1, it is convenient to re-write this term as xy’ = [(a — 1) + 1] y’, that is, 


co 
zy = > Nanx(a —1)"—* 


n=1 


= > nay |(e—1)+1](e@—1)"7" 


n=1 
= S > nan (a — 1)? + S> nan (a — i’ (3.1.4) 
n=1 n=1 


As usual by now, the first sum on the right-hand side of Eq. (3.1.4) can start at n = 0, since 
we are only adding a zero term to the sum, that is, 


> Nay(x — 1)” = s NGy (a — 1)"; 
n=1 


n=0 


while it is convenient to relabel the second sum in Eq. (3.1.4) follows, 


> Nan (#— 1)" = So(n + 1)a(n41)(@ — 1)”; 
n=1 n=0 


so both sums in Eq. (3.1.4) have the same factors (a — 1)”. We obtain the expression 


oy = S- NOn (a — 1)" + So(n + eg sale =)" 
n=0 n=0 
= » [nan + (n+ L)@¢n41y] (2 - 1)”. (3.1.5) 
n=0 


In a similar way relabel the index in the expression for y’’, so we obtain 


CO 


of” = Sn +2(mt Laing) — I)”. (3.1.6) 


n=0 


If we use Eqs. (3.1.5)-(3.1.6) in the differential equation, together with the expression for y, 
the differential equation can be written as follows 


So (n+ 2)(n + l)agnga)(@ — 1)" — $7 [nan + (n+ Vagryty] (@ — 1)” — 5 an(a - 1)" =0. 
n=0 n=0 n=0 


We can now put all the terms above into a single sum, 


Co 


y [in + 2)(n + 1)amn42) — (2 + 1)ain4i1y — NGn an| (2 —1)” =0. 


n=0 


This expression provides the recurrence relation for the coefficients a, with n > 0, that is, 
(n+ 2)(n + 1)aingeay — (n+ 1)aing1) — (2 + 1)an = 0 
(n+ 1)[(m + 2)aqn42) — ang) — On] = 0, 
which can be rewritten as follows, 


(n ate 2)Q(n42) — @(n41) — an = 0. (3.1.7) 
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We can solve this recurrence relation for the first four coefficients 


ay Qo 

=0 Qa, — a, — a =0 = = += 

n Ag — A, — Ao Qo 2 2” 
ay Ao 

= 3a3 — a, — a, =0 => = =+— 

n 3 — Ag — Ay a3 2 6° 
ay Qo 

= 2 4 —_ —_ = 0 => =—_+ —, 
n G4 — G3 — Ag a4 A 6 


Therefore, the first terms in the power series expression for the solution y of the differential 
equation are given by 


_ : : Qo ‘ ay ae (2 =) 3 (2 ; =) 4, eat 
y=ata(a + (2+ SF) 1)*4 rae IY +(erz)e 1)* 4 
which can be rewritten as 
1 1 1 
= 14 1)? 4 1)° 4 tte] 
y= afl + 5(@— 1)? + 5(2-1)9 + =(2-1) 

| | 1 2 i 1 3 i 1 - 

! a,{(e 1) + 5(e- 1)? + 5(@-1)9 + Fe 144 
So the first four terms on each fundamental solution are given by 

1 1 il 
Y=i14 5 (# 1)? + g 1)? 4 g(t 1)", 


1 i, 1 
Yo = (x 1) + 5(e 1) t 5 (e 1)°+ 72 i. 


J 


Example 3.1.5. Find the first three terms of the power series expansion around the point 
Zo = 2 of each fundamental solution to the differential equation 


y —xy=0. 
Solution: We then look for solutions of the form 
y= S- An (a — 2)” 
n=0 


It is convenient to rewrite the function xy = [(a — 2) + 2] y, that is 


xy = > Anx(a — 2)” 
n=0 


= S/ an[(# — 2) + 2] (a — 2)" 


n=0 
= So an(a — 2)"*2 + S° 2an (a — 2)”. (3.1.8) 
n=0 n=0 


We now relabel the first sum on the right-hand side of Eq. (3.1.8) in the following way, 


a e—2)""" = wee (3.1.9) 
n=0 n=1 
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We do the same type of relabeling on the expression for y”, 


y= S- n(n — 1)an(a — 2)"~? 


Me 


(n+ 2)(n + 1)acn42)(x — 2)”. 


Then, the differential equation above can be written as follows 


co 


So (n+ 2)(n + Langa) ( (a —2 "= So toate — 2) Yin (n—1) (a — 2)" =0 
n=0 =I 
(2)(1)a. — 2a + 3 [(n +2)(n + lamse) — 2an — ann] (x — 2)" =0. 


n=1 


So the recurrence relation for the coefficients a, is given by 
Az — Gy = 0, (n+ 2)(n ou 1)a(n+2) — 2a — Q(n—1) = 0, n2i. 


We can solve this recurrence relation for the first four coefficients, 


n=0 Ay — A = 0 => lg = Ao, 

n=1 (3)(2)as — 2a, — a = 0 => a= ets, 
Ao ay 

HY (4) (3)a4 az — a, = 0 => a4 5 io 


Therefore, the first terms in the power series expression for the solution y of the differential 
equation are given by 


yaa toledo 2+ (eM) le2 + (eB )iena en 


which can be rewritten as 


1 1 
y= ao[1 + (x 2)? 4 g(* a4 5 (@ atten] 
1 1 
+a, (e 2)4 3 (e 2) 4 9% 2)* 4 | 
So the first three terms on each fundamental solution are given by 
1 
y. =1+ (e@— 2)? 4 5 be a 
= (2-2) + =(e— 2)? + (2-2)! 
Yo = (x 3 zx 1D 4H : 


J 


3.1.3. The Legendre Equation. The Legendre equation appears when one solves the 
Laplace equation in spherical coordinates. The Laplace equation describes several phenom- 
ena, such as the static electric potential near a charged body, or the gravitational potential 
of a planet or star. When the Laplace equation describes a situation having spherical sym- 
metry it makes sense to use spherical coordinates to solve the equation. It is in that case 
that the Legendre equation appears for a variable related to the polar angle in the spherical 
coordinate system. See Jackson’s classic book on electrodynamics [8], § 3.1, for a derivation 
of the Legendre equation from the Laplace equation. 
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Example 3.1.6. Find all solutions of the Legendre equation 
(1-2?) y” —2ry' +1(1+1)y=0, 
where | is any real constant, using power series centered at x» = 0. 
Solution: We start writing the equation in the form of Theorem 3.1.2, 
7 Qu ,, +2) 
i=)" =a)" 
It is clear that the coefficient functions 


a fies i(i+1) 
p(x) = (1 — 22)’ a( ) (1 — 2)’ 


= 0. 


are analytic on the interval |z| < 1, which is centered at 2 = 0. Theorem 3.1.2 says that 
there are two solutions linearly independent and analytic on that interval. So we write the 
solution as a power series centered at 2% = 0, 


co 
y(x) = » Anz”, 
n=0 
and we compute its derivative, 
co co co 
7 (2) = S- iar = a Ce So(n + 1)@(n41) 2”, 
n=0 n=1 n=0 


where the first equality is the plain derivative, in the second we start the sum at n = 1 since 
the first term in the sum is zero, and in the third equality we rename the summation index 
n — n—1, so when the old index starts at one, the new index starts at zero. The second 
derivative of y is treated in a similar way, 


y' (a) = ye n(n —1)a, 2°? = SS n(n —1)an 2"? = So(n + 2)(n + 1)ain4ey 2”. 
n=0 n=2 n=0 


Then we continue working as follows, 


y= So(n + 2)(n + Lanta) &”, 


li+l)y= 2 Wl+ lana”. 


n=0 
The Legendre equation says that the addition of the four equations above must be zero, 


S"((n + 2)(n + l)aing2) — (2 — 1)nan — 2nay + U(L+ Lan) x” = 0. 


n=0 


Therefore, every term in that sum must vanish, 


(n+ 2)(n + 1l)amn4z2) — (n — 1)nay — 2nay + 11+ 1)an = 0, n 


W 
i) 
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This is the recurrence relation for the coefficients a,. After a few manipulations the recur- 
rence relation becomes 


(l—n)(lt+n+1) 


— > 
Q(n+2) (n Ee 2)(n cs 1) an, Nhe 0 
By giving values to n we obtain, 
ii+1 1—1)(1+2 
a ( Tl Vice gee ae) vase 
Since aq is related to a2 and as is related to a3, we get, 
_ (f-2)04+3) _ (—2)l0 +1) +3) 
“ea _ 41 20; 
(l—3)(1+ 4) (i— 3)(0—1)(14+ 2)(1 + 4) 
a5 = ———— > = ay 


Oo ” = BI 


If one keeps solving the coefficients a, in terms of either ag or a ,, one gets the expression, 


rem 01 a a ea = 
f-1)(@+2 (—3)(0-—1)0+2)(+4 
gf gS i ODED i) 


Hence, the fundamental solutions are 


ii+1 [—-2)¢+1)0+3 
(1+1) 2, @= 20+ E43) a 
2! 4! 
I-1 - — 
(= 0+2) 5, C= 9E-YU4 0449 5 
3! 5! 
The ration test provides the interval where the seires above converge. For function y, we 
get, replacing n by 2n, 


ys(z) =1 


Yo(x) = x 


Q2n seen oe (1 — 2n)(1+ 2n 4+ 1) | 2 


2 
> —> 7 
an (Qn + 1)(2n + 2) el ee ares 


a2n 


A similar result holds for y,. So both series converge on the interval defined by |a| <1. < 


Remark: The functions y;, y2 are called Legendre functions. For a non-integer value of 
the constant / these functions cannot be written in terms of elementary functions. But 
when / is an integer, one of these series terminate and becomes a polynomial. The case 
! being a nonnegative integer is specially relevant in physics. For / even the function y, 
becomes a polynomial while y, remains an infinite series. For | odd the function y, becomes 
a polynomial while the y, remains an infinite series. For example, for | = 0,1, 2,3 we get, 


1=0, yi(x) = 1, 

l=1, Yy2(x) = 2, 

[= 2, yi(a) = 1-327, 
5 

1=8, ye(a) = 2-3 2° 


The Legendre polynomials are proportional to these polynomials. The proportionality fac- 
tor for each polynomial is chosen so that the Legendre polynomials have unit length in a 
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particular chosen inner product. We just say here that the first four polynomials are 


1=0, yi(a) = 1, Po =, Po(«) = 1, 
f=1, y.(r) = 2, Pr = 4, Pi(z) =a, 
1 1 
[= 2, yi(x) = 1-327, P= 5 Ms P,(x) = 5 (32° — 1), 
5 3 3 1, 4 
1=3, y(t) =x—- 5% : Py=—5 Ya Ps(x) = 5 (52 — 32). 


These polynomials, P,,, are called Legendre polynomials. The graph of the first four Le- 
gendre polynomials is given in Fig. 1. 


FIGURE 1. The graph of the first four Legendre polynomials. 


3.1.4. Exercises. 


3.1.1.- . 


3.1. SOLUTIONS NEAR REGULAR POINTS 


3.1.2.- . 
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3.2. Solutions Near Regular Singular Points 


We continue with our study of the solutions to the differential equation 


y" + p(x) y' +(x) y = 0. 
In § 3.1 we studied the case where the coefficient functions p and q were analytic functions. 
We saw that the solutions were also analytic and we used power series to find them. In 
§ ?? we studied the case where the coefficients p and q were singular at a point x). The 
singularity was of a very particular form, 


p(x) = Pe 


_ qo 
=a.) ' q(x) = ( 2? 


L — Lo) 
where Po, go are constants. The equation was called the Euler equidimensional equation. 
We found solutions near the singular point x). We found out that some solutions were 
analytic at 2 and some solutions were singular at xo. In this section we study equations 
with coefficients p and q being again singular at a point v). The singularity in this case is 
such that both functions below 


(w—2)p(z), — (w ~ %0)*q(a) 


are analytic in a neighborhood of 2). The Euler equation is the particular case where these 
functions above are constants. Now we say they admit power series expansions centered at 
Zo. So we study equations that are close to Euler equations when the variable x is close to 
the singular point xz). We will call the point 2» a regular singular point. That is, a singular 
point that is not so singular. We will find out that some solutions may be well defined at 
the regular singular point and some other solutions may be singular at that point. 


3.2.1. Regular Singular Points. In § 3.1 we studied second order equations 
y" + p(x) y' + 9(x)y =0. 


and we looked for solutions near regular points of the equation. A point 2 is a regular point 
of the equation iff the functions p and q are analytic in a neighborhood of x. In particular 
the definition means that these functions have power series centered at 2o, 


p(t)= > pn(e—a)", (2) = 9 an (@ — 2)”, 
n=0 n=0 


which converge in a neighborhood of 2. A point 2p is called a singular point of the equation 
if the coefficients p and q are not analytic on any set containing x). In this section we study 
a particular type of singular points. We study singular points that are not so singular. 


Definition 3.2.1. A point x € R is a regular singular point of the equation 
y" + p(x) y' + q(x) y = 0. 
iff both functions py, and q,, are analytic on a neighborhood containing to, where 


Buo(w) =(—at)p(x), — Gug(x) = (% — ao)” G(x). 


Remark: The singular point x) in an Euler equidimensional equation is regular singular. 
In fact, the functions p,, and gz, are not only analytic, they are actually constant. The 
proof is simple, take the Euler equidimensional equation 

Po y + do 


i 
+ 
q (a — 2) (a — 2 


pe 
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and compute the functions p,, and gz, for the point 2%o, 


Poo() = (@—2—)( >) = Por dealt) = (@ ~ 20)" oe) = we 


Lv — Xo)? 


Example 3.2.1. Show that the singular point of Euler equation below is regular singular, 
(x — 3)? y” + 2(a — 3) y' +4y =0. 


Solution: Divide the equation by (a — 3)?, so we get the equation in the standard form 
" 2 py 4 


Yea) Gage 


The functions p and q are given by 


2 4 
p(x) = («—3)’ q(x) = («@—3)?” 


The functions p3 and q3 for the point x) = 3 are constants, 


Bs(2) = (@-3)(—=5) =2, f(z) = @-3"(— +5) a 


Therefore they are analytic. This shows that x) = 3 is regular singular. <J 


Example 3.2.2. Find the regular-singular points of the Legendre equation 
(L— 2?) y” —2ey' +1l+1y=0, 
where | is a real constant. 


Solution: We start writing the Legendre equation in the standard form 
2x Wi +1) 
" ry 
4 Ga)" * ay? 


The functions p and q are given by 


= 0, 


2x Wi +1) 
As ae 1) By 
These functions are analytic except where the denominators vanish. 
‘ LZ =1, 
(1-2?)=(1-a)(l+2)=0 > 
t=. 


Let us start with the singular point 7 = 1. The functions p,, and q,, for this point are, 


Pag (@) = (@ — I)p(@) = (a 1)( (1— 7G + >) => Pr (t) = ae 
: Hf »f Ul +1) Lt 1(@—1) 
ee ae (a=ana+H) = feo) "ny 


These two functions are analytic in a neighborhood of 2) = 1. (Both p,, and g,, have no 
vertical asymptote at x) = 1.) Therefore, the point 2. = 1 is a regular singular point. We 
now need to do a similar calculation with the point x, = —1. The functions ,, and qz, for 
this point are, 


pale) =@ +I) =@+0(-g—oyagy) 7 Aa =-Goap 
ie) = + ihde)= 4G aa ee) F HEAR 
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These two functions are analytic in a neighborhood of 7, = —1. (Both py, and @;, have no 
vertical asymptote at 2, = —1.) Therefore, the point 7, = —1 is a regular singular point.< 


Example 3.2.3. Find the regular singular points of the differential equation 


(a + 2)?(@ — 1) y+ 3(@— 1) y+ 2y=0. 


Solution: We start writing the equation in the standard form 
MW 3 y’ { 2 y= 
(a+2)27 © (2+ 2)2(x— 1) 


The functions p and q are given by 


(= (x) ; 
£) = ——; x£) = ——.—_... 

e (@+22? 7 (« + 2)2(@ —1) 

The denominators of the functions above vanish at x» = —2 and x, = 1. These are singular 
points of the equation. Let us find out whether these singular points are regular singular or 
not. Let us start with x) = —2. The functions p,, and q,, for this point are, 


3 3 


Bos () = (0+ 2)p(a) = (2 +2)( 55) > Balt) = Ey 


0. 


2 2 e 2 

) Canes 5) > dae(®) = — Gy 

We see that q,, is analytic on a neighborhood of x) = —2, but p,, is not analytic on any 
neighborhood containing x) = —2, because the function p,, has a vertical asymptote at 
XZ = —2. So the point xz) = —2 is not a regular singular point. We need to do a similar 
calculation for the singular point x, = 1. The functions p,, and @,, for this point are, 

3 3(a — 1) 


Bro) = (e- Vol) =(@- (Soap) Pal) = Tye 


Ge.(x) = (u + 2)*q(x) = (a +2 


des (2) = (2 — 1)?a(2) = (@ -1)°( aie =i) da,(2) = a 


We see that both functions p,, and @;, are analytic on a neighborhood containing 7, = 1. 
(Both p,, and g;, have no vertical asymptote at x, = 1.) Therefore, the point x, = 1 isa 
regular singular point. <J 


Remark: It is fairly simple to find the regular singular points of an equation. Take the 
equation in out last example, written in standard form, 


uw 3 ry 2 
Yo 4229  @422e_N 


The functions p and q are given by 


y = 0. 


3 2 
p(x) = (e422 q(x) = (c@+2°(@—1) 
The singular points are given by the zeros in the denominators, that is 7» = —2 and x, = 1. 
The point 2 is not regular singular because function p diverges at x) = —2 faster than 
on The point 2, = 1 is regular singular because function p is regular at 7, = 1 and 


function q diverges at x, = 1 slower than 


ae 
(x — 1)?" 
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3.2.2. The Frobenius Method. We now assume that the differential equation 
y +p(x)y’ + q(x) y = 0, (3.2.1) 
has a regular singular point. We want to find solutions to this equation that are defined 
arbitrary close to that regular singular point. Recall that a point x is a regular singular 
point of the equation above iff the functions (a — 2) p and (x — 2) q are analytic at x. A 


function is analytic at a point iff it has a convergent power series expansion in a neighborhood 
of that point. In our case this means that near a regular singular point holds 


(a — 2) p(x) = > pn (@ — ao)” = po + pila — 20) + pala — a)? +--- 


n=0 


(a — 2)? q(x) = D7 dn (@ — 20)” = G0 + a1(@ — 20) + G2(a — 20)? ++-° 
n=0 


This means that near x, the function p diverges at most like (x — 2 )~! 


diverges at most like (x — x))~?, as it can be seen from the equations 
Po 

p(x) = ——~ 

(x) Caen 


and function q 


+ pi + po(@— 2%) +--: 


qo 1 
+ 
L—-Xy)2 (L#— 2X) 


+qa+-:: 


q(x) = 


Therefore, for po and go nonzero and « close to 2 we have the relations 


Po qo 


p(x) ~ aa q(x) ~ Caray LY Be; 


where the symbol a ~ b, with a, b € R means that |a — b| is close to zero. In other words, 
the for x close to a regular singular point 2, the coefficients of Eq. (3.2.1) are close to the 
coefficients of the Euler equidimensional equation 

(x — xo)” yf + o(# — 20) Ye + do Ye = 0, 
where pp and qo are the zero order terms in the power series expansions of (x — 2) p and 
(x—2 9)? q given above. One could expect that solutions y to Eq. (3.2.1) are close to solutions 
ye to this Euler equation. One way to put this relation in a more precise way is 


y(x) = ye(x) b> An(x@— ao)” =>  y(ax) = ye(x) (ao + ai(2 — 2) +:: ae 
n=0 


Recalling that at least one solution to the Euler equation has the form y.(x) = (a — 2)", 
where r is a root of the indicial polynomial 


r(r —1)+ por + go = D, 


we then expect that for x close to x» the solution to Eq. (3.2.1) be close to 


co 
y(2) = (2 — 20)" >) an(e — 2)”. 
n=0 
This expression for the solution is usually written in a more compact way as follows, 


co 
ule) = Sag a = 24). 
n=0 
This is the main idea of the Frobenius method to find solutions to equations with regular 
singular points. To look for solutions that are close to solutions to an appopriate Euler 
equation. We now state two theorems summarize a few formulas for solutions to differential 
equations with regular singular points. 
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Theorem 3.2.2 (Frobenius). Assume that the differential equation 
y" + p(x) y' + a(x) y =0, (3.29) 


has a regular singular point x) € R and denote by po, qo the zero order terms in 
co co 

(«— 29) pl) => pn (w= 2)", (@— 49)? 4a) = Dn (e— 2)". 
n=0 n=0 


Let r,, r- be the solutions of the indicial equation 
r(r—1)+por+q =0. 


(a) If (r+—r-) is not an integer, then the differential equation in (3.2.2) has two independent 
solutions y,., y- of the form 


co 
ie >. Qn (@— 2)", with ag =1, 


n=0 


y-(x) = |x — xo|"” >. bn (a@— 20)", with bo =1. 


n=0 


ye(e) = |e — 2 


(b) If (rs —r-) = N, a nonnegative integer, then the differential equation in (3.2.2) has two 
independent solutions y,, y- of the form 


co 
y+(“) = |x — 2o|"* > Qn (@— 2)", with ap =1, 
n=0 


y-(“) = |x — 2o|"" bn (@ — &o)" +cy.(x) Inja— aol, with bo = 1. 


n=0 
The constant c is nonzero if N =0. If N > 0, the constant c may or may not be zero. 
In both cases above the series converge in the interval defined by |x — x| < p and the 
differential equation is satisfied for 0 < |x — a | < p. 
Remarks: 


(a) The statements above are taken from Apostol’s second volume [2], Theorems 6.14, 6.15. 
For a sketch of the proof see Simmons [10]. A proof can be found in [5, 7]. 

(b) The existence of solutions and their behavior in a neighborhood of a singular point was 
first shown by Lazarus Fuchs in 1866. The construction of the solution using singular 
power series expansions was first shown by Ferdinand Frobenius in 1874. 


We now give a summary of the Frobenius method to find the solutions mentioned in 
Theorem 3.2.2 to a differential equation having a regular singular point. For simplicity we 
only show how to obtain the solution y,. 


(1) Look for a solution y of the form y(x) = > lena, 
n=0 


(2) Introduce this power series expansion into the differential equation and find the indicial 
equation for the exponent r. Find the larger solution of the indicial equation. 


(3) Find a recurrence relation for the coefficients ay. 


(4) Introduce the larger root r into the recurrence relation for the coefficients a,. Only 
then, solve this latter recurrence relation for the coefficients a,. 


(5) Using this procedure we will find the solution y, in Theorem 3.2.2. 
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We now show how to use these steps to find one solution of a differential equation near a 
regular singular point. We show the case where the roots of the indicial polynomial differ by 


an integer. We show that in this case we obtain only solution y,. The solution y- does not 
co 


have the form y(x) = S- Gn (a — &)"*"), Theorem 3.2.2 says that there is a logarithmic 


n=0 
term in the solution. We do not compute that solution here. 


Example 3.2.4. Find the solution y near the regular singular point x) = 0 of the equation 


ay” —a(2+3)y + (e#+3)y=0. 


Solution: We look for a solution of the form 
y(x) = S- An get) | 
n=0 


The first and second derivatives are given by 


co co 


Ye) So(n +rja,c°-)), y' (a) = Son tr)(ntr—1)an at), 
n=0 n=0 
In the case r = 0 we had the relation 7° 9 nay, 2) = T° na,2-). But in our 


case r # 0, so we do not have the freedom to change in this way the starting value of the 
summation index n. If we want to change the initial value for n, we have to re-label the 
summation index. We now introduce these expressions into the differential equation. It is 
convenient to do this step by step. We start with the term (a + 3)y, which has the form, 


(c+ 3)y =(x+3) » An ot") 


n=0 


s An crt td) + x Ban ot") 


n=0 n=0 


S- @(n—1) grt) 4 S- 3a, 0°), (3.2.3) 


n=1 n=0 


We continue with the term containing y’, 


—a(x2+3)y' = —(2? +32) ym + r)an oPtr-Y) 


n=0 
=— So(n + 1r)dn ort) — S- 3(n + r)an at") 
n=0 n=0 
=— DG: ei Dae 5 pint) — a 3(n + r)an ot), (3.2.4) 
n=1 n=0 


Then, we compute the term containing y’’ as follows, 
co 
ay” = 2 So(n +r)(n+r—1)an et") 
n=0 


= See Lan, gorrr), (3.2.5) 


n=0 
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As one can see from Egqs.(3.2.3)-(3.2.5), the guiding principle to rewrite each term is to 
have the power function x("*") labeled in the same way on every term. For example, in 
Egs.(3.2.3)-(3.2.5) we do not have a sum involving terms with factors 2("+"—) or factors 
a(+r+)) Then, the differential equation can be written as follows, 


So(n +r)(n+r—1)a, at” — So(n per Lae 4) attr) 
n=0 n=1 


- S- 3(n+r)an grt) 4 + Qtn—1) Pr SS Ban 0"t") = 0. 
n=0 n=1 n=0 


In the equation above we need to split the sums containing terms with n > 0 into the term 
n = 0 and a sum containing the terms with n > 1, that is, 


[r(r — 1) — 8r + 3]agx"4 


S- [(n brit — Ld, — (nr Dag — Sr), + ag iy + 3an| grt") = 0, 
n=1 


and this expression can be rewritten as follows, 


[r(r 1)—3r4 3] ax" 4 


DL [le+nmstr 1) —3(n+r)4 3] an (ntr—1 Vain] grt") — 9 


and then, 


YL [e+ rn tr—1)—3(n+r 1)| ae (n+r 2)a¢n—1)| gtr) — 9 


n=1 


hence, 


[r(r — 1) — 8r + 3) aon" + S- [in tr—1)(n+r—3)a,—(n+r 2)a(n—1)| glint") — 0, 
n=1 
The indicial equation and the recurrence relation are given by the equations 
r(r —1)-—38r+3=0, (3.2.6) 
(n+r—1)(n+r—3)an — (n+7r—2)a(n_1) = 0. (3.2.7) 


The way to solve these equations in (3.2.6)-(3.2.7) is the following: First, solve Eq. (3.2.6) for 
the exponent r, which in this case has two solutions r+; second, introduce the first solution 
r, into the recurrence relation in Eq. (3.2.7) and solve for the coefficients a,; the result is 
a solution y, of the original differential equation; then introduce the second solution r_ into 
Eq. (3.2.7) and solve again for the coefficients a,,; the new result is a second solution y_. Let 
us follow this procedure in the case of the equations above: 


r?—4r+3=0 rs = 5 [44 VIe— 1] => { 


=, 


 — — 


Introducing the value r, = 3 into Eq. (3.2.7) we obtain 
(n+ 2)na, — (n+ 1)an_-1 = 0. 


One can check that the solution y, obtained form this recurrence relation is given by 


es ul 1 
yo(@) = ag a*[14+ 5a+ Fa" + at +. ; 
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Notice that r, — rr. = 3—1 = 2, this is a nonpositive integer. Theorem 3.2.2 says that 
the solution y_ contains a logarithmic term. Therefore, the solution y_ is not of the form 


co 
:> dnw'"t™, as we have assumed in the calculations done in this example. But, what does 
n=0 
happen if we continue this calculation for r. = 1? What solution do we get? Let us find 
out. We introduce the value r_ = 1 into Eq. (3.2.7), then we get 

n(n — 2)an — (n — 1)an_1 = 0. 


One can also check that the solution y_ obtained form this recurrence relation is given by 


z 2 1 1 
§-(e) = ane]? 32 4 jt tae |, 
2 1 1 a 
3 2 3 ~ 2 
= i be | > F=—y. 
a2 0°| sora ae Ge 
So get a solution, but this solution is proportional to y,. To get a solution not proportional 
to y, we need to add the logarithmic term, as in Theorem 3.2.2. <J 


3.2.3. The Bessel Equation. We saw in § 3.1 that the Legendre equation appears 
when one solves the Laplace equation in spherical coordinates. If one uses cylindrical co- 
ordinates insted, one needs to solve the Bessel equation. Recall we mentioned that the 
Laplace equation describes several phenomena, such as the static electric potential near a 
charged body, or the gravitational potential of a planet or star. When the Laplace equation 
describes a situation having cylindrical symmetry it makes sense to use cylindrical coordi- 
nates to solve it. Then the Bessel equation appears for the radial variable in the cylindrical 
coordinate system. See Jackson’s classic book on electrodynamics [8], § 3.7, for a derivation 
of the Bessel equation from the Laplace equation. 

The equation is named after Friedrich Bessel, a German astronomer from the first half 
of the seventeen century, who was the first person to calculate the distance to a star other 
than our Sun. Bessel’s parallax of 1838 yielded a distance of 11 light years for the star 
61 Cygni. In 1844 he discovered that Sirius, the brightest star in the sky, has a traveling 
companion. Nowadays such system is called a binary star. This companion has the size 
of a planet and the mass of a star, so it has a very high density, many thousand times 
the density of water. This was the first dead start discovered. Bessel first obtained the 
equation that now bears his name when he was studing star motions. But the equation 
first appeared in Daniel Bernoulli’s studies of oscillations of a hanging chain. (Taken from 
Simmons’ book [10], § 34.) 


Example 3.2.5. Find all solutions y(a) = > anx"*", with ao 4 0, of the Bessel equation 
n=0 


ay! +ay + (a? —a*)y=0, x>O0, 
where a is any real nonnegative constant, using the Frobenius method centered at x = 0. 


Solution: Let us double check that 2) = 0 is a regular singular point of the equation. We 
start writing the equation in the standard form, 


2 2 
Wa ee =u) 
Ho x 


y =0, 


so we get the functions p(x) = 1/x and q(x) = (a? — a?)/x?. It is clear that x = 0 isa 
singular point of the equation. Since the functions 


B(x) =ap(z)=1, — G(x) = 2"q(x) = (a* — a”) 
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are analytic, we conclude that x) = 0 is a regular singular point. So it makes sense to look 
for solutions of the form 


y(a) = S- iper x>0. 
n=0 


We now compute the different terms needed to write the differential equation. We need, 


xy(x) = oy Ane?) =  y(x) = S- Gin—2yu'"t), 
n=0 n=2 


where we did the relabeling n + 2 =m — n. The term with the first derivative is given by 
xy (x) = So(n + r)anxt, 
n=0 
The term with the second derivative has the form 


gy" (2) = So(n t+r)(n+r—1)ana*), 
n=0 


Therefore, the differential equation takes the form 


So(n tr)(ntr—1)anat” + So(n + r)ayxt”) 
n=0 n=0 


+ Yaga — rat aal) = 
n=2 n=0 


Group together the sums that start at n = 0, 


S [mt rn tr—1)t (nt) —a2]an 2) + Sagan, 
n=0 na2 


and cancel a few terms in the first sum, 


ss [(n Te ry = a” |an ttn) 4 S- Oe oe = 0. 
n=0 n=2 


Split the sum that starts at n = 0 into its first two terms plus the rest, 


(r? on a”) ao ar of [(r +4 1)? 5 a?| ay gt FY) 


+ S [(n +r)? —a*]an ttn) S- A(n—2) ar) — Q, 
n=2 n=2 


The reason for this splitting is that now we can write the two sums as one, 


(r? — a7 )ay 2” + [(r +1)? — a7 Ja, gt th) 4 eS [(n +1)? — a7] an + an—2) } gtr) — 9, 
n=2 


We then conclude that each term must vanish, 
(7? —a7)ao =0, [(r+1)?-a7]a,=0, [(n+r)?—a7]an+an—2) =0, n> 2. (3.2.8) 


This is the recurrence relation for the Bessel equation. It is here where we use that we look 
for solutions with a, 4 0. In this example we do not look for solutions with a, 4 0. Maybe 
it is a good exercise for the reader to find such solutions. But in this example we look for 
solutions with ap 4 0. This condition and the first equation above imply that 


r?—a? =0 


r Qa, 
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and recall that a is a nonnegative but otherwise arbitrary real number. The choice r = r, 
will lead to a solution y,, and the choice r = r_ will lead to a solution y_g. These solutions 
may or may not be linearly independent. This depends on the value of a, since r, —r_ = 2a. 
One must be careful to study all possible cases. 


Remark: Let us start with a very particular case. Suppose that both equations below hold, 
(r? — a?) =0, [(r + 1)? — a?] =0. 


This equations are the result of both a, 4 0 and a, 4 0. These equations imply 


1 

Pa(rtl? > Wwt+i=0 > r=—s. 
But recall that r = ta, and a = 0, hence the case a # 0 and a, # 0 happens only when 
a = 1/2 and we choose r_ = —a = —1/2. We leave computation of the solution y_;/2 as an 


exercise for the reader. But the answer is 


cos(2) “ sin(a) 


y_1/2(2) = ao Va ay Vi : 


From now on we assume that a #4 1/2. This condition on a, the equation r? — a? = 0, and 
the remark above imply that 

(r +1)? -—a? £0. 
So the second equation in the recurrence relation in (3.2.8) implies that a, = 0. Summariz- 
ing, the first two equations in the recurrence relation in (3.2.8) are satisfied because 


r4- =a, a, = 0. 


We only need to find the coefficients a,, for n > 2 such that the third equation in the 
recurrence relation in (3.2.8) is satisfied. But we need to consider two cases, r =r, = a and 
r= —a. 
We start with the case r= r, = a, and we get 
(n? + 2na) dn +Am-2)=0 +S n(n+ 2a) ap = —A(n_2). 
Since n > 2 and a > 0, the factor (n + 2a) never vanishes and we get 
eee, 
n(n + 2a) 
This equation and a, = 0 imply that all coefficients a2,41 = 0 for k > 0, the odd coefficients 
vanish. On the other hand, the even coefficent are nonzero. The coefficient ag is 
do Ao 


= —- = —___ 
oe 80 + Dan) 02 Pt ay 


the coefficient a4 is 
ag ag = 5 Ao 
a4= = -_ ; 
4 4(4-+ 20) 22(2)(2 +a) 4°" 94(2)(1 + a)(2 + a) 


the coefficient ag is 


a4 a4 a Ao 
— — aq = . 
“6 6(6+2a) 22(3)(3 +a) 6 263) (1 + a)(2 + a)(3 +a) 
Now it is not so hard to show that the general term a2,, for k = 0,1,2,--- has the form 
(—1)*a, 


O2k = 92k (KI) + a)(2+a)---(k+a) 
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We then get the solution yo 


feerll iF a> 0. (3.2.9) 


all) = aap 

Yo(X) = Ao x I » 2k (KI) +a)(2+a)---(k +a) 
The ratio test shows that this power series converges for all x > 0. When a, = 1 the 
corresponding solution is usually called in the literature as Ja, 


Ja(x) = x% [1 ; > 0. 
ae * End ra@ray era) ° 
We now look for solutions to the Bessel equation coming from the choice r = r_ = —a, 


with a, = 0, and a 41/2. The third equation in the recurrence relation in (3.2.8) implies 
(n? — 2na)an + Qn—2)=0 => n(n—2a)ay = —a(n_2). 
If 2a = N, a nonnegative integer, the second equation above implies that the recurrence 
relation cannot be solved for a, with n > N. This case will be studied later on. Now assume 
that 2a is not a nonnegative integer. In this case the factor (n — 2a) never vanishes and 
G(n—2) 
ay, = ——-.. 
n(n — 2a) 
This equation and a, = 0 imply that all coefficients a2,41 = 0 for k > 0, the odd coefficients 
vanish. On the other hand, the even coefficent are nonzero. The coefficient a is 
Qo Qo 


08 92 — Ba) = ORL — a)’ 


the coefficient a4 is 
ag ag do 
— = > = 

4 4(4—2a) —-22(2)(2—a) ie 


the coefficient ag is 


ag = = = ua > a= a 
o  6(6—2a)—«22(3)(3 — a) ° ~~ 96(3I)(1 — a)(2— a)(3 — a)" 
Now it is not so hard to show that the general term a2x, for k = 0,1,2,--- has the form 
(—1)* ao 


“ok © PR (RL — a)(2— a) (ka) 
We then get the solution y_q 


a \ = (—1)* 
y-a(e) = a2 [14 Do mE ae—ay Rea a>0. (3.2.10) 


The ratio test shows that this power series converges for all x > 0. When a, = 1 the 
corresponding solution is usually called in the literature as J_q, 


- fore) (—1)* 
Jo(t) =a" [1+ 2. BEET = a)@— a) = =) 


a> 0. 


The function y_g was obtained assuming that 2a is not a nonnegative integer. From the 
calculations above it is clear that we need this condition on @ so we can compute a, in 
terms of a(n—2). Notice that rz = +a, hence (r, — r-) = 2a. So the condition on a is the 
condition (r, — r-) not a nonnegative integer, which appears in Theorem 3.2.2. 

However, there is something special about the Bessel equation. That the constant 2a 
is not a nonnegative integer means that a is neither an integer nor an integer plus one- 
half. But the formula for y_ is well defined even when a is an integer plus one-half, say 
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k + 1/2, for k integer. Introducing this y_(,41/2) into the Bessel equation one can check 
that y_(x41/2) is a solution to the Bessel equation. 

Summarizing, the solutions of the Bessel equation function yq is defined for every non- 
negative real number a, and y_,q is defined for every nonnegative real number a except for 
nonnegative integers. For a given a such that both yg and y_q are defined, these func- 
tions are linearly independent. That these functions cannot be proportional to each other 
is simple to see, since for a > 0 the function yg is regular at the origin x = 0, while y_, 
diverges. 

The last case we need to study is how to find the solution y_. when a is a nonnegative 
integer. We see that the expression in (3.2.10) is not defined when a is a nonnegative 
integer. And we just saw that this condition on a is a particular case of the condition in 
Theorem 3.2.2 that (r, —r_) is not a nonnegative integer. Theorem 3.2.2 gives us what is 
the expression for a second solution, y_, linearly independent of y,, in the case that a is a 
nonnegative integer. This expression is 

co 
Y—a(£) = Ya(x) In(a) + a7 ° > Cyt. 
n=0 
If we put this expression into the Bessel equation, one can find a recurrence relation for the 
coefficients c,. This is a long calculation, and the final result is 


Veal?) = Yo(z) In(x) 


i= es (Re Ptntas) L\2n 
(-1) n! (n+ a)! a) , 


with hy = 0, hy =1+4$+4---++4 for n 21, and a a nonnegative integer. <J 


3.2.4. Exercises. 


3.2.1.- . 
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Notes on Chapter 3 


Sometimes solutions to a differential equation cannot be written in terms of previously 
known functions. When that happens the we say that the solutions to the differential 
equation define a new type of functions. How can we work with, or let alone write down, a 
new function, a function that cannot be written in terms of the functions we already know? 
It is the differential equation what defines the function. So the function properties must be 
obtained from the differential equation itself. A way to compute the function values must 
come from the differential equation as well. The few paragraphs that follow try to give sense 
that this procedure is not as artificial as it may sound. 


Differential Equations to Define Functions. We have seen in § 3.2 that the solutions 
of the Bessel equation for a 4 1/2 cannot be written in terms of simple functions, such as 
quotients of polynomials, trigonometric functions, logarithms and exponentials. We used 
power series including negative powers to write solutions to this equation. To study prop- 
erties of these solutions one needs to use either the power series expansions or the equation 
itself. This type of study on the solutions of the Bessel equation is too complicated for these 
notes, but the interested reader can see [14]. 

We want to give an idea how this type of study can be carried out. We choose a 
differential equation that is simpler to study than the Bessel equation. We study two 
solutions, C and S, of this particular differential equation and we will show, using only the 
differential equation, that these solutions have all the properties that the cosine and sine 
functions have. So we will conclude that these solutions are in fact C(x) = cos(a#) and 
S(x) = sin(x). This example is taken from Hassani’s textbook [?], example 13.6.1, page 
368. 


Example 3.2.6. Let the function C' be the unique solution of the initial value problem 
Cc” +C=0, C(0)=1, C’(0) =0, 

and let the function S be the unique solution of the initial value problem 
S”+5=0, S(0)=0, $’(0)=1. 

Use the differential equation to study these functions. 


Solution: 
(a) We start showing that these solutions C and S' are linearly independent. We only need 
to compute their Wronskian at « = 0. 
W(0) = C(0) $’(0) — C’(0) S(0) =1 40. 
Therefore the functions C and S are linearly independent. 


(b) We now show that the function S is odd and the function C' is even. The function 
C(x) = C(—2) satisfies the initial value problem 


C"+C=C"4+C=0, C(0)=C(0)=1, C'(0)=-C'(0) =0. 


This is the same initial value problem satisfied by the function C. The uniqueness of 
solutions to these initial value problem implies that C(—x) = C(x) for all a € R, hence the 
function C' is even. The function S(x) = S(—2) satisfies the initial value problem 


S"+$=S8"4+5=0, $(0)=S(0)=0, S$'(0) =—-S'(0) =-1. 


This is the same initial value problem satisfied by the function —S. The uniqueness of 
solutions to these initial value problem implies that S(—x) = —S(x) for all  € R, hence 
the function S' is odd. 
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(c) Next we find a differential relation between the functions C and S. Notice that the 
function —C’ is the unique solution of the initial value problem 

(-C")"+(-C")=0, = -C"(0) = 0, (—C")'(0) = C(0) = 1. 


This is precisely the same initial value problem satisfied by the function S. The uniqueness 
of solutions to these initial value problems implies that —C = S, that is for all x € R holds 


C'(x) = —S(z). 
Take one more derivative in this relation and use the differential equation for C, 
S(@)=-C"(e)=Ca) => S'@)=CG@). 


(d) Let us now recall that Abel’s Theorem says that the Wronskian of two solutions to a 
second order differential equation y” + p(x) y' + q(x) y = 0 satisfies the differential equation 
W’ + p(x) W = 0. In our case the function p = 0, so the Wronskian is a constant function. 
If we compute the Wronskian of the functions C and S and we use the differential relations 
found in (c) we get 


W (x) = C(x) S’(x) — C'(x) S(x) = C?(x) + $?(x). 


This Wronskian must be a constant function, but at 2 = 0 takes the value W(0) = C?(0) + 
$?(0) = 1. We therefore conclude that for all x € R holds 


C? (a) + S7(x) =1. 


(e) We end computing power series expansions of these functions C' and S, so we have a 
way to compute their values. We start with function C. The initial conditions say 


C(0)=1, C'(0)=0. 


The differential equation at « = 0 and the first initial condition say that C’’(0) = —C(0) = 
—1. The derivative of the differential equation at x = 0 and the second initial condition say 
that C’”(0) = —C’(0) = 0. If we keep taking derivatives of the differential equation we get 


ce”(0)=-1, C”(0)=0, CM(0) =1, 


and in general, 


0 if n is odd, 
CoM (0) = a 
(-1)” ifn =2k, where k = 0,1,2,---. 
So we obtain the Taylor series expansion 
@ = -y' & 
C(x) = —1 ; 
=o 
which is the power series expansion of the cosine function. A similar calculation yields 
o0 , wht 
— = | c ————— 
a »| Y QeF Dl 


which is the power series expansion of the sine function. Notice that we have obtained these 
expansions using only the differential equation and its derivatives at x = 0 together with 
the initial conditions. The ratio test shows that these power series converge for all x € R. 
These power series expansions also say that the function S is odd and C is even. <J 
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Review of Natural Logarithms and Exponentials. The discovery, or invention, of a 
new type of functions happened many times before the time of differential equations. Look- 
ing at the history of mathematics we see that people first defined polynomials as additions 
and multiplications on the independent variable x. After that came quotient of polynomials. 
Then people defined trigonometric functions as ratios of geometric objects. For example the 
sine and cosine functions were originally defined as ratios of the sides of right triangles. 
These were all the functions known before calculus, before the seventeen century. Calculus 
brought the natural logarithm and its inverse, the exponential function together with the 
number e. 

What is used to define the natural logarithm is not a differential equation but integra- 
tion. People knew that the antiderivative of a power function f(a) = 2” is another power 
function F(a) = «(+ /(n +1), except for n = —1, where this rule fails. The antiderivative 
of the function f(#) = 1/z is neither a power function nor a trigonometric function, so at 
that time it was a new function. People gave a name to this new function, In, and defined 
it as whatever comes from the integration of the function f(x) = 1/z, that is, 


in(x) = [ ae x > 0. 
1 8 


All the properties of this new function must come from that definition. It is clear that this 
function is increasing, that In(1) = 0, and that the function take values in (—oo, 00). But 
this function has a more profound property, In(ab) = In(a) + In(b). To see this relation first 


compute 
ab a ab 
In(ab) = “-/ ae ge 
1 


1 8 8 a 8 


then change the variable in the second term, § = s/a, so dS = ds/a, hence ds/s = ds/8, and 
ag b 4~ 
In(ab) = / Oren Se ead), 
ie 2-9, 1 48 


The Euler number e is defined as the solution of the equation In(e) = 1. The inverse of the 
natural logarithm, In~', is defined in the usual way, 


In‘(_y)=2 © In(x)=y, x€(0,0), y€ (—~,00). 


Since the natural logarithm satisfies that In(a,27.) = In(#1) + In(z2), the inverse function 
satisfies the related identity n7'(y; + y2) = In~'(y,) In7'(y2). To see this identity compute 


In“"(y, + y2) = In7* (In(a) + In(22)) = In‘ (In(a122)) = 2,22 = In7~*(y,) In7* (yp). 


This identity and the fact that In~'(1) = e imply that for any positive integer n holds 


: n times P 
n times n times 


SS, OO i 
In“*(n) =n? (14+..-4+ 1=n_1(1)--- In? (1)=€--- =e”. 


This relation says that In~* is the exponential function when restricted to positive integers. 
This suggests a way to generalize the exponential function from positive integers to real 
numbers, eY = In~'(y), for y real. Hence the name exponential for the inverse of the natural 
logarithm. And this is how calculus brought us the logarithm and the exponential functions. 
Finally notice that by the definition of the natural logarithm, its derivative is In’(x) = 
1/x. But there is a formula relating the derivative of a function f and its inverse f~}, 
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Using this formula for the natural logarithm we see that 


1 
In7)'(y) = ————— = In“1(y)). 
In other words, the inverse of the natural logarithm, call it now g(y) = In‘ (y) = e¥, must 
be a solution to the differential equation 


g'(y) = 9(y). 
And this is how logarithms and exponentials can be added to the set of known functions. 
Of course, now that we know about differential equations, we can always start with the 
differential equation above and obtain all the properties of the exponential function using 
the differential equation. This might be a nice exercise for the interested reader. 


CHAPTER 4 


The Laplace Transform Method 


The Laplace Transform is a transformation, meaning that it changes a function into a new 
function. Actually, it is a linear transformation, because it converts a linear combination of 
functions into a linear combination of the transformed functions. Even more interesting, the 
Laplace Transform converts derivatives into multiplications. These two properties make the 
Laplace Transform very useful to solve linear differential equations with constant coefficients. 
The Laplace Transform converts such differential equation for an unknown function into an 
algebraic equation for the transformed function. Usually it is easy to solve the algebraic 
equation for the transformed function. Then one converts the transformed function back 
into the original function. This function is the solution of the differential equation. 

Solving a differential equation using a Laplace Transform is radically different from all 
the methods we have used so far. This method, as we will use it here, is relatively new. The 
Laplace Transform we define here was first used in 1910, but its use grew rapidly after 1920, 
specially to solve differential equations. Transformations like the Laplace Transform were 
known much earlier. Pierre Simon de Laplace used a similar transformation in his studies of 
probability theory, published in 1812, but analogous transformations were used even earlier 
by Euler around 1737. 
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4.1. Introduction to the Laplace Transform 


The Laplace transform is a transformation—it changes a function into another function. 
This transformation is an integral transformation—the original function is multiplied by an 
exponential and integrated on an appropriate region. Such an integral transformation is 
the answer to very interesting questions: Is it possible to transform a differential equation 
into an algebraic equation? Is it possible to transform a derivative of a function into a 
multiplication? The answer to both questions is yes, for example with a Laplace transform. 
This is how it works. You start with a derivative of a function, y’(t), then you multiply 
it by any function, we choose an exponential e~**, and then you integrate on t, so we get 


¥(@)> / em y!(t) dt, 


which is a transformation, an integral transformation. And now, because we have an inte- 
gration above, we can integrate by parts—this is the big idea, 


y(t) > ie y (t) dt =e y(t) +5 fe**y(t dt. 


So we have transformed the derivative we started with into a multiplication by this constant 
s from the exponential. The idea in this calculation actually works to solve differential 
equations and motivates us to define the integral transformation y(t) + Y(s) as follows, 


Oise / ent y(t) dt. 


The Laplace transform is a transformation similar to the one above, where we choose some 
appropriate integration limits—which are very convenient to solve initial value problems. 

We dedicate this section to introduce the precise definition of the Laplace transform 
and how is used to solve differential equations. In the following sections we will see that 
this method can be used to solve linear constant coefficients differential equation with very 
general sources, including Dirac’s delta generalized functions. 


4.1.1. Oveview of the Method. The Laplace transform changes a function into 
another function. For example, we will show later on that the Laplace transform changes 
a 
We will follow the notation used in the literature and we use t for the variable of the 
original function f, while we use s for the variable of the transformed function F’. Using 
this notation, the Laplace transform changes 


f(x) =sin(az) into F(x) = 


a 
s*+a?- 

We will show that the Laplace transform is a linear transformation and it transforms deriva- 
tives into multiplication. Because of these properties we will use the Laplace transform to 
solve linear differential equations. 

We Laplace transform the original differential equation. Because the the properties 
above, the result will be an algebraic equation for the transformed function. Algebraic 
equations are simple to solve, so we solve the algebraic equation. Then we Laplace transform 
back the solution. We summarize these steps as follows, 


f(t) =sin(at) into F(s)= 


differential sees Solve the Transform back 
uterentia (1) gebralc . algebraic (8), to obtain y. 
eq. for Ly]. 


eq. for y. 
eq. for Ly]. (Use the table.) 
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4.1.2. The Laplace Transform. The Laplace transform is a transformation, meaning 
that it converts a function into a new function. We have seen transformations earlier in these 
notes. In Chapter 2 we used the transformation 


Lly(t)] = y(t) + ar y(t) + ao y(t), 


so that a second order linear differential equation with source f could be written as L[y] = f. 
There are simpler transformations, for example the differentiation operation itself, 


Df@®] = f'O.- 


Not all transformations involve differentiation. There are integral transformations, for ex- 
ample integration itself, 


eo) = [Fiat 
ie) 
Of particular importance in many applications are integral transformations of the form 
b 
rio =f KU.) fan, 
where K is a fixed function of two variables, called the kernel of the transformation, and a, 


b are real numbers or +oo. The Laplace transform is a transfomation of this type, where 
the kernel is K(s,t) = e~*¢, the constant a = 0, and b= co. 


Definition 4.1.1. The Laplace transform of a function f defined on Dy = (0,00) is 


F(s) =a e * f(t) dt, (4.1.1) 
0 
defined for alls € Dr CR where the integral converges. 


In these note we use an alternative notation for the Laplace transform that emphasizes 
that the Laplace transform is a transformation: L[f] = F’, that is 


fal j= fen ) dt. 


So, the Laplace transform will be denoted as either £[f] or F', depending whether we want 
to emphasize the transformation itself or the result of the transformation. We will also use 
the notation Lif (t)], or L[f](s), or L[f(t)](s), whenever the independent variables ¢ and s 
are relevant in any particular context. 

The Laplace transform is an improper integral—an integral on an unbounded domain. 
Improper integrals are defined as a limit of definite integrals, 


co N 
: g(t) dt = lim g(t) dt. 
to 


Noo t 
(0) 


An improper integral converges iff the limit exists, otherwise the integral diverges. 
Now we are ready to compute our first Laplace transform. 


Example 4.1.1. Compute the Laplace transform of the function f(t) = 1, that is, [1]. 


Solution: Following the definition, 


foe) N 
L{1] = ; e dt= lim ed. 
0 


Noo 0 
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The definite integral above is simple to compute, but it depends on the values of s. For 
5 = 0 we get 


So, the improper integral diverges for s = 0. For s £ 0 we get 


N 


: —at * 1 —st “ * 1 —sN 
lim e *dt= lim —-e = lim —-(e —1). 
Noo Jo N-+co 8 0 N40 8 
For s < 0 we have s = —|s|, hence 
1 1 
lim —=(e** —1)= lim (elsIN — 1) = on. 
No § N-o  § 


So, the improper integral diverges for s < 0. In the case that s > 0 we get 


Example 4.1.2. Compute L[e%’], where a € R. 


Solution: We start with the definition of the Laplace transform, 


Lle™)] = e **(e%*) dt = e-e-ab ae. 
0 (0) 
In the case s = a we get 
fie) = | ldt =~, 
0 


so the improper integral diverges. In the case s £ a we get 


N 
Lie" | = wim eal" i, s#a, 
0 
= N 
= lim ( ) e (s-at 
N-ool(s — a) 0 


= lin (=) (e-a)y — ay]. 


N-+00 (s _ a) 


Now we have to remaining cases. The first case is: 


s—-a<0 => -(s—a)=|s—al/>0 > lim e@-9% =~, 
N-0oo 
so the integral diverges for s < a. The other case is: 
s-a>0 => (s—a)=—-|s—a|<0 3 lim e @-9* =9, 
N-oco 


so the integral converges only for s > a and the Laplace transform is given by 


| a s>a. 
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Example 4.1.3. Compute L{te*], where a € R. 


Solution: In this case the calculation is more complicated than above, since we need to 
integrate by parts. We start with the definition of the Laplace transform, 


foe) N 
cite | = | é te” di— lim i ie OO ge. 
0 N-0oo 0 


This improper integral diverges for s = a, so £[te®] is not defined for s = a. From now on 


we consider only the case s £ a. In this case we can integrate by parts, 


1 N 1 ° 
Lite“) = lim [- te (eat) / eon at, 
N-ool (s—a) o s-a J, 
that is, 
N 1 N 
Lite] = lim |- te (s-a)e|" _ __“___-(s--a)t |. (4.1.2) 
N-ool (s—a) 0 (s—a)? 0 
In the case that s < a the first term above diverges, 
1 
lim — Ne~&S-O*% = lim — Nel&-% = o0, 
N- oo (s = a) Noo (s — a) 


therefore L{te“’] is not defined for s < a. In the case s > a the first term on the right hand 
side in (4.1.2) vanishes, since 
1 1 
lim ———Ne@-9* =0, te @-9#| = 0. 
N>co (s—a) (s —a) t=0 
Regarding the other term, and recalling that s > a, 


1 1 


1 
li og (s-a)N _ 0 _*—(s—a)t a Se _ 
Ss (s—a)?~ ’ (s—a)?- = (s—a)? 
Therefore, we conclude that 
1 
ie") = —— > 0k 
Hels Grae 87 P 
Example 4.1.4. Compute L[sin(at)], where a € R. 
Solution: In this case we need to compute 
oe) N 
L{sin(at)] = | e*' sin(at) dt = lim e~*' sin(at) dt 
0 N-oo 0 


The definite integral above can be computed integrating by parts twice, 
| N 


i 1 
7 e*' sin(at) dt = ——[e~* sin(at)] 
0 (0) 


a mx ee re 
aa ler" cos(at)] | _ al e % sin(at) dt, 
S 8 0 }~©6s? Jy 


which implies that 
2 


N 
(1 +4 =) | e~*' sin(at) dt = -* [e~*' sin(at)| 


then we get 


[ e * sin(at) dt = aa [-= [e-** sin(at)] 


and finally we get 


N a _ st N 
ee [e ** cos(at) | . 


(0) 


, —stu: 8 1 —s : a —s 
7 e * sin(at) dt = wre [-- [e~ *% sin(aN) — 0] — ale N cos(aN) — i]. 
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One can check that the limit N — co on the right hand side above does not exist for s < 


so £[sin(at)] does not exist for s < 0. In the case s > 0 it is not difficult to see that 


[ sintat) at = ( = )[-@ 0) 5 (0 1) 


s*t+a?/ ls 


so we obtain the final result 


L{sin(at)] = ear s>0. 
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0, 


J 


In Table 1 we present a short list of Laplace transforms. They can be computed in the 


same way we computed the the Laplace transforms in the examples above. 


f(t) F(s) = L[f(@)] Dr 
f(t)=1 F(s) =~ a 
f(t) =e Fs) => T 
fo=e" F(s) = 5 s>0 
f(t) = sin(at) F(s)=5 > s>0 

f(t) = cos(at) F(s)=5 - s>0 

f(t) =sinh(at) F(s)=5 * = s > al 
f(t) = cosh(at) F(s) = ee s > |al 
fiat es Fs) = —— ay s>a 
f(t) = e@ sin(0t) Pia) = = s>a 

f(t) = e% cos(bt) F(s) = wate s>a 

f(t) = e sinh(dt) F(s)= GOLF s—a> |b 
f(t) = e* cosh(bt) F(s) = ee s—a> |b| 


TABLE 1. List of a few Laplace transforms. 
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4.1.3. Main Properties. Since we are more or less confident on how to compute 
a Laplace transform, we can start asking deeper questions. For example, what type of 
functions have a Laplace transform? It turns out that a large class of functions, those that 
are piecewise continuous on [0,0o) and bounded by an exponential. This last property is 
particularly important and we give it a name. 


Definition 4.1.2. A function f defined on [0,00) is of exponential order s), where so is 
any real number, iff there exist positive constants k, T such that 


lf()| < ke forall t>T. (4.1.3) 


Remarks: 


(a) When the precise value of the constant s>) is not important we will say that f is of 
exponential order. 
2 
(b) An example of a function that is not of exponential order is f(t) = e* . 
This definition helps to describe a set of functions having Laplace transform. Piecewise 
continuous functions on [0,00) of exponential order have Laplace transforms. 


Theorem 4.1.3 (Convergence of LT). If a function f defined on [0,00) is piecewise con- 
tinuous and of exponential order 8), then the Lif] exists for all s > 8. and there exists a 
positive constant k such that 


Proof of Theorem 4.1.3: From the definition of the Laplace transform we know that 


Lif] = lim e* f(t) dt. 


N-oco 0 


The definite integral on the interval [0,.N] exists for every N > 0 since f is piecewise 
continuous on that interval, no matter how large N is. We only need to check whether the 
integral converges as N — oo. This is the case for functions of exponential order, because 


N N N N 
/ e-** F(z) dt| < | e**| f(t)| dt 2) e thes dt = ‘| e~ (8—S0)t dt. 
10) 10) 0 0 


Therefore, for s > s) we can take the limit as N > ov, 


k 
(S$ — 8) 
Therefore, the comparison test for improper integrals implies that the Laplace transform 
Lif] exists at least for s > 59, and it also holds that 


k 
|L[f]| < ; S> 8. 


S— Sp 


|L[f]| < lim 


N-oo 


i‘. e~** F(t) at| <k Lie**| = 


This establishes the Theorem. 

The next result says that the Laplace transform is a linear transformation. This means 
that the Laplace transform of a linear combination of functions is the linear combination of 
their Laplace transforms. 
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Theorem 4.1.4 (Linearity). If L[f] and Lig] exist, then for alla, b € R holds 
Liaf + bg] =aLllf] + bL{g]. 


Proof of Theorem 4.1.4: Since integration is a linear operation, so is the Laplace trans- 
form, as this calculation shows, 


Liaf + bg] = a e [af (t) + bg(t)] dt 


0) 


af espns [em g(oat 
=aL|f] + bL{g]. 


This establishes the Theorem. 


Example 4.1.5. Compute £[3t? + 5 cos(4t)]. 
Solution: From the Theorem above and the Laplace transform in Table ?? we know that 


L[3t? + 5 cos(4t)] = 3 L[t?] + 5 Licos(4t)| 


2 Ss 
=3(5)+5(aye) #>0 
_ 6 5s 
~ g8 2 4 42° 
Therefore, 
5s* + 6s? + 96 
L|3t? + 5 cos(4t)] = —_~—_—__, s>0. 
| (46)| s3(s? + 16) <J 
The Laplace transform can be used to solve differential equations. The Laplace trans- 
form converts a differential equation into an algebraic equation. This is so because the 
Laplace transform converts derivatives into multiplications. Here is the precise result. 


Theorem 4.1.5 (Derivative into Multiplication). If a function f is continuously differen- 
tiable on [0,co) and of exponential order so, then Li f'] exists for s > 5) and 


Lif']=sL[f]—f(0), $s > so. (4.1.4) 


Proof of Theorem 4.1.5: The main calculation in this proof is to compute 
N 


hy : —st gl 
els) = im, fe" fae 
We start computing the definite integral above. Since f’ is continuous on [0, 00), that definite 
integral exists for all positive N, and we can integrate by parts, 


[ etroa= [err |? - [ corerststo ae 


=e F(N) - flO) +5 f e * f(t) dt. 


10) 
We now compute the limit of this expression above as N — oo. Since f is continuous on 
[0, co) of exponential order s9, we know that 


N 
slim, ; e"fHd=Llfl, S > So. 
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Let us use one more time that f is of exponential order sy. This means that there exist 
positive constants k and T such that |f(t)| < ke%°*, for t > T. Therefore, 


lim e*" f(N) < lim ke~*%e*¥ = lim ke 8-80)" — 9, S > 8. 
N—- 00 N—-0o N—- 00 


These two results together imply that L[f’] exists and holds 
L[f'}=sLlf]—f(0), > S0. 


This establishes the Theorem. 


Example 4.1.6. Verify the result in Theorem 4.1.5 for the function f(t) = cos(0t). 


Solution: We need to compute the left hand side and the right hand side of Eq. (4.1.4) and 
verify that we get the same result. We start with the left hand side, 


2 
Lif’] = £L[—bsin(bt)] = —b L[sin(bt)] = —d x => Lif'|= -aH 
We now compute the right hand side, 
8 s* — s* — p? 
sLif|— f(0) = sL{cos(bt)] -1=s Pa 1= aR 
so we get 
b2 
sL[f] — f(0) = era y 
We conclude that L[f’] = s Lif] — f(0). <J 


It is not difficult to generalize Theorem 4.1.5 to higher order derivatives. 


Theorem 4.1.6 (Higher Derivatives into Multiplication). If a function f is n-times con- 
tinuously differentiable on [0,00) and of exponential order so, then L[f”],--» , Lf] exist 
for s > 8 and 


£[f”] = 8° Lif] — s f(0) — f'(0) (4.1.5) 


LIF] = 8” £[f] — 8-9 FQ) —--- — FOYE). (4.1.6) 


Proof of Theorem 4.1.6: We need to use Eq. (4.1.4) n times. We start with the Laplace 
transform of a second derivative, 


The formula for the Laplace transform of an nth derivative is computed by induction on n. 
We assume that the formula is true for n — 1, 


per er a ae) ace), 
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Since L[f™] = L[(f’)-Y], the formula above on f’ gives 


LUFYOY] = aD LIF] — 8 F(0) —--- — FO (0) 
= 8) (sL£[F] — £(0)) — 8 (0) = = F-(0) 
Se Lif a FO) 9) FO) ae 0), 


This establishes the Theorem. 


Example 4.1.7. Verify Theorem 4.1.6 for f”, where f(t) = cos(bt). 


Solution: We need to compute the left hand side and the right hand side in the first 
equation in Theorem (4.1.6), and verify that we get the same result. We start with the left 
hand side, 


MW) py _p2 — _p2 — _p2 8 i bes 
Lif"| = L[—0* cos(bt)] = —b° L[cos(bt)] = —b PaP = Lc |= Pa 
We now compute the right hand side, 
3 _ 3 _ p2 
3 7 a ee 2 Ss a =e — b*s 
s' Lif] —s f(0) — f'(0) = s* L[cos(bt)] - s -O=s 212 2a 
so we get 
2 i bs 
s* Lif] —s f(0) — f'(0) = he 
We conclude that Lif”) = s? Lf] — s f(0) — f’(0). <J 


The Laplace transform also satisfies a converse to Theorem 4.1.5, since multiplications 
can be transformed into derivatives. 


Theorem 4.1.7 (Multiplication into Derivative). If a function f is of exponential order 85 
with a Laplace transform F(s) = Lif ()], then Lit f(t)] exists for s > 8) and 


Lit f(t)}=—F"(s), 8 > 5. (4.1.7) 


Proof of Theorem 4.1.7: From the definition of the Laplace Transform we see that 


This establishes the Theorem. 
The result in Theorem 4.1.7 can be generalized to higher powers. 


Theorem 4.1.8 (Higher Powers into Derivative). If a function f is of exponential order 85 
with a Laplace transform F(s) = Lif (t)], then Lit” f(t)] exists for s > 5) and 


Lit” f(t)] = (-1)"F™ (s), o> Bp, (4.1.8) 


d”™ 
where we denoted F™ = aa 
gt 
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Proof of Theorem 4.1.8: We use induction one more time. The case n = 1 is done in 
Theorem 4.1.7. We now assume that 
nm n d” 
Lie" FO] = (-)" FO), 


and we try to show that a similar formula holds for n+ 1. But this is the case, since 


LEY FH] = Lert) 
= (1) ce 0) 


since t f(t) satisfies the hypotheses in Theorem 4.1.7, since f(t) does. Then we use Theorem 
4.1.7 one more time, 


= (yr SS 


q+) 
—ypjy(t) 2 
( 1) qenay Et ? 


(jee (s). 


Lif), 


I 


This establishes the Theorem. 


4.1.4. Solving Differential Equations. The Laplace transform can be used to solve 
differential equations. We Laplace transform the whole equation, which converts the differ- 
ential equation for y into an algebraic equation for £L[y]. We solve the Algebraic equation 
and we transform back. 


: ; . Solve the Transform back 
differential (1) Algebraic (2) ; (3) : 
Bei algebraic eh to obtain y. 
eq. for Ly]. 


eq. for y. 
ee eq. for Ly]. (Use the table.) 


Example 4.1.8. Use the Laplace transform to find y solution of 
y"+9y=0, (0) =4, y'(0) =H. 


Remark: Notice we already know what the solution of this problem is. Following § 2.3 we 
need to find the roots of 


pr)=r?+9 => r= +38, 

and then we get the general solution 
y(t) = c, cos(3t) + c_ sin(3t). 

Then the initial condition will say that 

y(t) = yo cos(3t) + 5 sin(3t). 
We now solve this problem using the Laplace transform method. 
Solution: We now use the Laplace transform method: 

Lily” + 9y] = L[0] = 0. 

The Laplace transform is a linear transformation, 


Lly"| + 9L[y] = 0. 


4.1. INTRODUCTION TO THE LAPLACE TRANSFORM 185 


But the Laplace transform converts derivatives into multiplications, 
s’ Lly| — sy(0) — y'(0) + 9 Ly] = 0. 
This is an algebraic equation for £[y]. It can be solved by rearranging terms and using the 
initial condition, 
8 1 


(s? +9) Ly] = $Y +m = Ly] = yo (s? +9) + Ys (s2 +9) 


But from the Laplace transform table we see that 


S . 3 
pag = fine] - Se 


L[cos(3t)] = 
therefore, 
Lly] = Yo L{cos(3t)] + y: ; L{sin(3t)]. 
Once again, the Laplace transform is a linear transformation, 
Lly| = Ll cos(3t) + > sin(3t) |. 


We obtain that 
y(t) = Yo cos(3t) + sin(3t). 
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4.1.5. Exercises. 


4.1.1. 
(a) 


(b) 


4.1.2.- 


4.1.4.- 


Compute the definite integral 


N 
dx 
el = 


Use the result in (a) to compute 


Compute the definite integral 


N 
Ty = edt. 


Use the result in (a) to compute 
i= ‘f e* dt. 


Compute the definite integral 


N 
w= 
0 


Use the result in (a) to compute 
F(s) = Lle**]. 


Indicate the domain of F’. 


—st 2t 
ee dt. 


Compute the definite integral 


N 
v= f ete dt. 
0 


Use the result in (a) to compute 
F(s) = Lite~*"]. 


Indicate the domain of F’. 


4.1.5.- 


4.1.7.- 


ETHOD 


Compute the definite integral 
N 
In = / e ** sin(2t) dt. 
ty) 


Use the result in (a) to compute 
F(s) = L{sin(2¢)]. 


Indicate the domain of F’. 


Compute the definite integral 


N 
In = / e** cos(2t) dt. 
0 
Use the result in (a) to compute 
F(s) = L{cos(2¢)]. 


Indicate the domain of F’. 


Use the definition of the Laplace 


transform to compute 


F(s) = L{sinh(at)], 


and indicate the domain of F’. 


4.1.8.- 


* Use the definition of the Laplace 


transform to compute 


F'(s) = L{cosh(at)], 


and indicate the domain of F’. 
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4.2. The Initial Value Problem 


We will use the Laplace transform to solve differential equations. The main idea is, 


. . . Solve the Transform back 
differential eq. (1) Algebraic eq. (a) ; (3) : 
4 —-5 algebraic eq. —5 to obtain y(t). 
for y(t). for Lly(t)]. 
for Liy(t)]. (Use the table.) 


We will use the Laplace transform to solve differential equations with constant coeffi- 
cients. Although the method can be used with variable coefficients equations, the calcula- 
tions could be very complicated in such a case. 

The Laplace transform method works with very general source functions, including step 
functions, which are discontinuous, and Dirac’s deltas, which are generalized functions. 


4.2.1. Solving Differential Equations. As we see in the sketch above, we start with 
a differential equation for a function y. We first compute the Laplace transform of the whole 
differential equation. Then we use the linearity of the Laplace transform, Theorem 4.1.4, and 
the property that derivatives are converted into multiplications, Theorem 4.1.5, to transform 
the differential equation into an algebraic equation for £L[y]. Let us see how this works in a 
simple example, a first order linear equation with constant coefficients—we already solved 
it in § 1.1. 


Example 4.2.1. Use the Laplace transform to find the solution y to the initial value prob- 
lem 


yi +2y=0, = y(0) =3. 


Solution: In § 1.1 we saw one way to solve this problem, using the integrating factor method. 
One can check that the solution is y(t) = 3e~7’. We now use the Laplace transform. First, 
compute the Laplace transform of the differential equation, 


Lily’ + 2y|] = £L[0] = 0. 
Theorem 4.1.4 says the Laplace transform is a linear operation, that is, 
Lly’] +2Lfy] = 0. 


Theorem 4.1.5 relates derivatives and multiplications, as follows, 


(sly]—y(0)) +2£p)=0 + (5 +2)L[y] = (0), 


In the last equation we have been able to transform the original differential equation for y 
into an algebraic equation for L[y]. We can solve for the unknown L[y] as follows, 


ety] = 4 ~ l= 
) 


where in the last step we introduced the initial condition y(0) = 3. From the list of Laplace 
transforms given in §. 4.1 we know that 


1 = 3 
s—a s+2 


3 
_ = = =3t 
=3Lle"] = Pew aca ie 


So we arrive at L[y(t)] = £[3e~7]. Here is where we need one more property of the Laplace 
transform. We show right after this example that 


Lly(t)]) =L£[Be"*] => y(t) =3e*. 


This property is called one-to-one. Hence the only solution is y(t) = 3e~ 7°. <J 


Lle™] — 
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4.2.2. One-to-One Property. Let us repeat the method we used to solve the differ- 
ential equation in Example 4.2.1. We first computed the Laplace transform of the whole 
differential equation. Then we use the linearity of the Laplace transform, Theorem 4.1.4, and 
the property that derivatives are converted into multiplications, Theorem 4.1.5, to trans- 
form the differential equation into an algebraic equation for L[y]. We solved the algebraic 
equation and we got an expression of the form 


Lly(t)] = H(s), 


where we have collected all the terms that come from the Laplace transformed differential 
equation into the function H. We then used a Laplace transform table to find a function h 
such that 


Clearly, y = fh is one solution of the equation above, hence a solution to the differential 
equation. We now show that there are no solutions to the equation L[y] = £L[h] other than 
y = h. The reason is that the Laplace transform on continuous functions of exponential 
order is an one-to-one transformation, also called injective. 


Theorem 4.2.1 (One-to-One). If f, g are continuous on [0,00) of exponential order, then 


Lifi=£lol = f=9. 


Remarks: 


(a) The result above holds for continuous functions f and g. But it can be extended to 
piecewise continuous functions. In the case of piecewise continuous functions f and g 
satisfying £L[f] = £L[g] one can prove that f = g +h, where h is a null function, meaning 
that i h(t) dt = 0 for all T > 0. See Churchill’s textbook [4], page 14. 

(b) Once we know that the Laplace transform is a one-to-one transformation, we can define 
the inverse transformation in the usual way. 


Definition 4.2.2. The inverse Laplace transform, denoted L—', of a function F is 


LF(s)=f@Q) @ F(s) =f). 


Remarks: There is an explicit formula for the inverse Laplace transform, which involves 
an integral on the complex plane, 
1 a+ic 
CF (s)|| = — lin e* F'(s) ds. 


t 21 cro Jie 


See for example Churchill’s textbook [4], page 176. However, we do not use this formula in 
these notes, since it involves integration on the complex plane. 


Proof of Theorem 4.2.1: The proof is based on a clever change of variables and on 
Weierstrass Approximation Theorem of continuous functions by polynomials. Before we get 
to the change of variable we need to do some rewriting. Introduce the function u = f — g, 
then the linearity of the Laplace transform implies 


Llu] = Lif — 9] = Lf] — Lig] = 0. 
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What we need to show is that the function u vanishes identically. Let us start with the 
definition of the Laplace transform, 


We know that f and g are of exponential order, say s9, therefore u is of exponential order 
S 9, meaning that there exist positive constants & and T such that 


| u(t) | <he, t>T. 


Evaluate £L[u] at § = s;+n-+1, where s, is any real number such that s,; > s9, and n is any 
positive integer. We get 


=i eter tntl)t u/s) a= [ e 8tt (MED a(t) dt. 


5 0 0 


We now do the substitution y = e~*, so dy = —e~* dt, 
10) 1 
Liul)_ - / y** y” u(—In(y)) (—dy) = ‘| y** y" u(—In(y)) dy. 
3 1 0 


Introduce the function u(y) = y*! u((— In(y)), so the integral is 


| y” v(y) dy. (4.2.1) 


We know that L[u] exists because u is continuous and of exponential order, so the function 
v does not diverge at y = 0. To double check this, recall that t = —In(y) > co as y > 0*, 
and u is of exponential order s), hence 


Llu] 


‘ a —sit F —(si-so)t — 
ue |u(y)| = jim e™**"{u(t)| < lim e 0. 


Our main hypothesis is that £[u] = 0 for all values of s such that £[u] is defined, in particular 
8. By looking at Eq. (4.2.1) this means that 


1 
| reway=o, n=1,2,3,---. 
10) 


The equation above and the linearity of the integral imply that this function v is perpen- 
dicular to every polynomial p, that is 


1 
/ ply) u(y) dy = 0, (4.2.2) 
0 
for every polynomial p. Knowing that, we can do the following calculation, 
1 1 1 
/ v*(y) dy = : (v(y) — p(y) vy) dy +f p(y) v(y) dy. 
0 10) 10) 


The last term in the second equation above vanishes because of Eq. (4.2.2), therefore 


< max jv(o)|_f v(u) — Plo) dy. (4.2.3) 


We remark that the inequality above is true for every polynomial p. Here is where we use the 
Weierstrass Approximation Theorem, which essentially says that every continuous function 
on a closed interval can be approximated by a polynomial. 
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Theorem 4.2.3 (Weierstrass Approximation). If f is a continuous function on a closed 
interval [a,b], then for every « > 0 there exists a polynomial q. such that 


max | f(y) — de(y)| <e. 
ye [a,b] 


The proof of this theorem can be found on a real analysis textbook. Weierstrass result 
implies that, given v and e€ > O, there exists a polynomial p, such that the inequality 
in (4.2.3) has the form 


Z 1 
7 v*(y) dy < max |v(y)| / |v(y) — pe(y)| dy < max |u(y)|e. 
0 y€[0,1] 0 y€[0,1] 


Since € can be chosen as small as we please, we get 


[Pwa=o 


But v is continuous, hence v = 0, meaning that f = g. This establishes the Theorem. 


4.2.3. Partial Fractions. We are now ready to start using the Laplace transform to 
solve second order linear differential equations with constant coefficients. The differential 
equation for y will be transformed into an algebraic equation for L[y]. We will then arrive 
to an equation of the form L[y(t)] = H(s). We will see, already in the first example below, 
that usually this function H does not appear in Table 1. We will need to rewrite H as a 
linear combination of simpler functions, each one appearing in Table 1. One of the more 
used techniques to do that is called Partial Fractions. Let us solve the next example. 


Example 4.2.2. Use the Laplace transform to find the solution y to the initial value prob- 
lem 


y"—y'-2y=0, yO=1, — y/(0) =0. 

Solution: First, compute the Laplace transform of the differential equation, 

Lly” — y' — 2y] = £[0] = 0. 
Theorem 4.1.4 says that the Laplace transform is a linear operation, 

Lly"| — Lly’] — 2L[y] = 0. 
Then, Theorem 4.1.5 relates derivatives and multiplications, 

[s? Llu] - sy(0) — v'(0)] — [s Clu] - y(0)] - 2£1y] =0, 
which is equivalent to the equation 
(s? — s — 2) Lily] = (s— 1) y(0) + y'(0). 


Once again we have transformed the original differential equation for y into an algebraic 
equation for £[y]. Introduce the initial condition into the last equation above, that is, 


(s? —s— 2) Lly] = (s — 1). 


Solve for the unknown L[y] as follows, 
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The function on the right hand side above does not appear in Table 1. We now use partial 
fractions to find a function whose Laplace transform is the right hand side of the equation 
above. First find the roots of the polynomial in the denominator, 


soe 3a 0 s, ==[1+V1+8] = { 


84 =2, 


s_=-l, 


that is, the polynomial has two real roots. In this case we factorize the denominator, 


(s—1) 
(s—2)(s +1) 


The partial fraction decomposition of the right-hand side in the equation above is the fol- 
lowing: Find constants a and 6b such that 
(s— 1) a b 


(s—2)\(s+1) s-2 s41 


Lly] = 


A simple calculation shows 


(s — 1) ae b _ as+1)+b(s—2) _ s(a+b) + (a—20) 
(s—2)(s+1) s-2 s41 (s — 2)(s +1) (s — 2)(s +1) 
Hence constants a and b must be solutions of the equations 
a+b=1, 
(s—1)=s(a+b)+(a—2b) => ae 
The solution is a = ; and b= :. Hence, 
1 1 2 1 

“WWl=3 G2 +3 G4) 

From the list of Laplace transforms given in § ??, Table 1, we know that 
1 1 1 
Ee => sag = Fle": a alt 


So we arrive at the equation 


We conclude that 
1 
y(t) = 3 (ec? + 2e7*). 
<q 
The Partial Fraction Method is usually introduced in a second course of Calculus to in- 


tegrate rational functions. We need it here to use Table 1 to find Inverse Laplace transforms. 
The method applies to rational functions 


where P, Q are polynomials and the degree of the numerator is less than the degree of the 
denominator. In the example above 


(s—1) 
(s? —s— 2)’ 
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One starts rewriting the polynomial in the denominator as a product of polynomials degree 
two or one. In the example above, 

(8-1) 
(s—2)(s +1) 
One then rewrites the rational function as an addition of simpler rational functions. In the 
example above, 


R(s) = 


a b 
R(s) = + : 
7) (s—2) (s+1) 
We now solve a few examples to recall the different partial fraction cases that can appear 
when solving differential equations. 


Example 4.2.3. Use the Laplace transform to find the solution y to the initial value prob- 
lem 
y"—4y'+4y=0, y()=1, y(O)=1. 

Solution: First, compute the Laplace transform of the differential equation, 

Liy” — 4y’ + 4y] = L[0] = 0. 
Theorem 4.1.4 says that the Laplace transform is a linear operation, 

Lly"| — 4L[y'] + 4£[y] = 0. 
Theorem 4.1.5 relates derivatives with multiplication, 

[5? Lt] — sy(0) — y'(0)] - 4s Cy] - y(0)] + 4£1y] = 0, 
which is equivalent to the equation 
(s? — 4s + 4) L[y] = (s — 4) y(0) + y/(0). 
Introduce the initial conditions y(0) = 1 and y’(0) = 1 into the equation above, 
(s? — 4s + 4) Lly] = 8 —3. 
Solve the algebraic equation for L[y], 
(s — 3) 


Lly| = ————_... 
lI (s? — 4s + 4) 
We now want to find a function y whose Laplace transform is the right hand side in the 
equation above. In order to see if partial fractions will be needed, we now find the roots of 
the polynomial in the denominator, 


1 
s?—-4s+4=0 = sz = 5 [4+ V16 — 16] > s,=s_=2. 


that is, the polynomial has a single real root, so £[y] can be written as 


(s — 3) 
Lily] = -——. 
[y] (s = 2)2 
This expression is already in the partial fraction decomposition. We now rewrite the right 


hand side of the equation above in a way it is simple to use the Laplace transform table in 
8 2?, 


(s—2)4+2-3 (s — 2) 1 1 1 
L = — L — . 
lw (s — 2) (s—2)2 (s—2) Oo GEoP 
From the list of Laplace transforms given in Table 1, § ?? we know that 
1 1 
Lie™| = > = fle", 


s—a s—2 
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Llte™|] = = ite"), 


(s—ap (6-2 
So we arrive at the equation 

Lly] = Lie] — Lite] =Lle*—te™]) => y(t) =e"*—-te™. 
dq 


Example 4.2.4. Use the Laplace transform to find the solution y to the initial value prob- 
lem 
y —4y'+4y=3e', (0) =0, (0) = 0. 
Solution: First, compute the Laplace transform of the differential equation, 
1 
Lly” — 4y’ + 4y] = L[8e"] = 3 (—). 
$— 
The Laplace transform is a linear operation, 
3 
s—1 
The Laplace transform relates derivatives with multiplication, 


[s? ctu] — (0) - ¥'(0)] — 4 [s Cia] - y(0)] +426) =, 


Lily") — 4L[y'} + 4L[y] = 


But the initial conditions are y(0) = 0 and y’(0) = 0, so 
3 

(s? — 4s + 4) £[y] = ——. 

s—l 


Solve the algebraic equation for L[y], 
3 


Liy| = : 
WW (s — 1)(s? — 4s + 4) 
We use partial fractions to simplify the right-hand side above. We start finding the roots of 
the polynomial in the denominator, 


1 ee 
s*—4s+4=0 sz = 5 [4+ V16 — 16] > sp=s_=2. 
that is, the polynomial has a single real root, so £[y] can be written as 
3 
Lily] = —————... 
Wl = GonjG—» 
The partial fraction decomposition of the right-hand side above is 
3 — a, bste _ a(s—2)?+(bs+c)(s —1) 
(s—1)(s—2)2 (s—1) | (s—2)? — (s —1)(s — 2)? 


From the far right and left expressions above we get 
3=a(s—2)?+ (bs +c)(s—1) =a(s? —48 +4) +bs? —bs+es—c 
Expanding all terms above, and reordering terms, we get 
(a+b) s* + (—4a—b+c)s+(4a—c-3) =0. 
Since this polynomial in s vanishes for all s € R, we get that 
a+b=0, a=3 
—4a-b+c=0, => b=-3 
4da—c—-3=0. c=9. 
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So we get 
c= 3 _ 3 | —3s4+9 
(s—1)(s—2)? s—1  (s—2)? 
One last trick is needed on the last term above, 
Sa+9  —3(s-—2+2)4+9 —3f{s 2) 6+9 _ 3 3 
(27 Gaz (ea (Saye (BD) (SDP 


So we finally get 


3 3 3 
Liy| = : 
Wl = 5-1 2) * 2p 
From our Laplace transforms Table we know that 
1 1 2t 
s—a * s—-2 meh 
1 7 1 
(s—a)? (s— 2)? 


Lile™| a 


Lite] = = £ite”*]. 


So we arrive at the formula 
Lly] = 3 Le") — 3 Le] + 3 Lite*"] = L[3(e' — ee + te”) 
So we conclude that y(t) = 3 (e’ — e + te”). <J 


Example 4.2.5. Use the Laplace transform to find the solution y to the initial value prob- 
lem 
y” — 4y’ + 4y = 3sin(2t), y(0) = 1, y (0) =1. 


Solution: First, compute the Laplace transform of the differential equation, 
Lily” — 4y' + 4y] = L[3 sin(2t))]. 


The right hand side above can be expressed as follows, 


2 6 
Sra O) en aa ee a 


Theorem 4.1.4 says that the Laplace transform is a linear operation, 


6 
m ! = 
fly’) —4£ly'] + 4£yl = Soa) 
and Theorem 4.1.5 relates derivatives with multiplications, 
6 
2 / - = 
[5? Llu] — sy(0) — ¥'(0)] — 4 [Llu] — w(0)] + 41] = 5. 


Reorder terms, 
* — 4s + 4) Lly] = (s — 4) y(0) + y(0) + = —.. 

(5? — 4s +4) Llu] = (5-4) ¥(0) + 9/0) + ag 

Introduce the initial conditions y(0) = 1 and y’(0) = 1, 
6 
2 
(s*° —48+ 4) Lly] =s—-3+ new 

Solve this algebraic equation for L[y], that is, 


(s— 3) 6 
Liy| = . 
Wl = (ia ® * 44 ee TD 
From the Example above we know that s? — 4s + 4 = (s — 2)?, so we obtain 
1 1 6 


cle Gee GaGa. eae) 
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From the previous example we know that 


1 1 
Lie** — te] = — ——x.;. 4.2.5 
(* — te] = 5 - Gp (4.2.5) 
We know use partial fractions to simplify the third term on the right hand side of Eq. (4.2.4). 
The appropriate partial fraction decomposition for this term is the following: Find constants 
a, b, c, d, such that 


6 pf ee, Cc eo d 
(s—2)2(s?4+4) 8244 ° (s—2)  (s—2)? 


Take common denominator on the right hand side above, and one obtains the system 


a+c=0, 
—4a+b-—2c+d=0, 
4a — 4b+ 4c = 0, 

4b — 8c+ 4d = 6. 


The solution for this linear system of equations is the following: 


3 3 3 
a= a b=0, Cae 5: 
Therefore, 
6 3 8 3. 6«d 3 1 
(s—2)2(s2?+4) 85244 8(s—2) | 4 (s—2)? 
We can rewrite this expression above in terms of the Laplace transforms given in Table 1, 
in Sect. ??, as follows, 


and using the linearity of the Laplace transform, 


6 3 eee 
(sd)? (2 +4) = c|= cos(2t) — 3° an zt tl. (4.2.6) 


Finally, introducing Eqs. (4.2.5) and (4.2.6) into Eq. (4.2.4) we obtain 


Lly(t)] = £[(1 — t) e** + : (=1 Soe" + : cos(2t)] ; 


Since the Laplace transform is an invertible transformation, we conclude that 


y(t) = (1-—t)e* + : (= Te 4 : cos(2t). 


<q 


4.2.4. Higher Order IVP. The Laplace transform method can be used with linear 
differential equations of higher order than second order, as long as the equation coefficients 
are constant. Below we show how we can solve a fourth order equation. 


Example 4.2.6. Use the Laplace transform to find the solution y to the initial value prob- 


lem 
y(0)=1, = -y'(0) =9, 
y"(0) =—2, (0) =0. 
Solution: Compute the Laplace transform of the differential equation, 
Lly — 4y] = £{0] = 0. 
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The Laplace transform is a linear operation, 
Lly] — 4L[y] = 0, 
and the Laplace transform relates derivatives with multiplications, 
[54 £ly] — 5° y(0) - s?y'(0) — sy"(0) —y""(0)] -— 4£ly] = 0. 


From the initial conditions we get 


[s* tu] s?-0+2s 0| ALly]=0 = (s*-4)Lly)=s?-28 > Llyl= 


In this case we are lucky, because 


Lly) = s(s* — 2) = Be 
(s? —2)(s?+2)  (s?+2) 
Since é 
L{cos(at)] = Pag 
we get that 


Lly| = Licos(V2t)} = y(t) = cos(V2t). 
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4.2.5. Exercises. 


4.2.1.- . 4.2.2.- . 
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4.3. Discontinuous Sources 


The Laplace transform method can be used to solve linear differential equations with dis- 
continuous sources. In this section we review the simplest discontinuous function—the step 
function—and we use steps to construct more general piecewise continuous functions. Then, 
we compute the Laplace transform of a step function. But the main result in this section 
are the translation identities, Theorem 4.3.3. These identities, together with the Laplace 
transform table in § 4.1, can be very useful to solve differential equations with discontinuous 
sources. 


4.3.1. Step Functions. We start with a definition of a step function. 


Definition 4.3.1. The step function at t = 0 is denoted by u and given by 


Q £<0, 
u(t) = : 6 (4.3.1) 


Example 4.3.1. Graph the step u, u(t) = u(t—c), and u_-(t) = u(t +c), for c > 0. 


Solution: The step function u and its right and left translations are plotted in Fig. 1. 


FIGURE 1. The graph of the step function given in Eq. (4.3.1), a right and 
a left translation by a constant c > 0, respectively, of this step function. 


dq 


Recall that given a function with values f(t) and a positive constant c, then f(t—c) and 
f(t+c) are the function values of the right translation and the left translation, respectively, 
of the original function f. In Fig. 2 we plot the graph of functions f(t) = e*’, g(t) = u(t) e” 
and their respective right translations by c > 0. 


FIGURE 2. The function f(t) = e’, its right translation by c > 0, the 
function f(t) = u(t) e™ and its right translation by c. 


Right and left translations of step functions are useful to construct bump functions. 
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Example 4.3.2. Graph the bump function b(t) = u(t — a) — u(t — 6), where a < b. 


Solution: The bump function we need to graph is 


0 t<a, 
b(t) =u(t—a)—u(t—b) © d(t)=41 ax<t<b (4.3.2) 
0 t>b. 


The graph of a bump function is given in Fig. 3, constructed from two step functions. Step 
and bump functions are useful to construct more general piecewise continuous functions. 


FIGuRE 3. A bump function 6 constructed with translated step functions. 


Example 4.3.3. Graph the function 
f(t) = [ule — 1) — u(é — 2)] e™. 


Solution: Recall that the function 
b(t) = u(t — 1) — u(t — 2), 


is a bump function with sides at t = 1 andf t = 2. 
Then, the function 


f(t) = bt) e™, 
is nonzero where b is nonzero, that is on [{1,2), and 
on that domain it takes values e*’. The graph of 
f is given in Fig. 4. <J 


FIGuRE 4. Function f. 


4.3.2. The Laplace Transform of Steps. We compute the Laplace transform of a 
step function using the definition of the Laplace transform. 


Theorem 4.3.2. For every number c € R and and every s > 0 holds 


ae 


jor ¢c 20, 
Llu(t —c)] = 
: jor c= 0. 


Proof of Theorem 4.3.2: Consider the case c > 0. The Laplace transform is 


Liu(t — c)] = i” e u(t —c)dt = i. e* dt, 
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where we used that the step function vanishes for t < c. Now compute the improper integral, 


—cs —cs 


e 


Cluft—o)] = lim —*(e-¥* — e-) = + pia 


Noo S S Ss 


Consider now the case of c < 0. The step function is identically equal to one in the domain 
of integration of the Laplace transform, which is [0,00), hence 


Llulé—e)] = _ e “u(t — c) dt = : e dt = L[1| = - 


This establishes the Theorem. 


Example 4.3.4. Compute £[3 u(t — 2)]. 
Solution: The Laplace transform is a linear operation, so 


L[3 u(t — 2)] = 3Lfu(t — 2)], 


3 —2s 
and the Theorem 4.3.2 above implies that £[3 u(t — 2)] = . <J 
Remarks: 
(a) The LT is an invertible transformation in the set of functions we work in our class. 
(b) C[fJ=F @ L[F)=f. 
e738 
Example 4.3.5. Compute £7! ‘ 
8 
e738 e738 

Solution: Theorem 4.3.2 says that = Liu(t — 3)], so read | = u(t — 3). <J 

8 


4.3.3. Translation Identities. We now introduce two properties relating the Laplace 
transform and translations. The first property relates the Laplace transform of a translation 
with a multiplication by an exponential. The second property can be thought as the inverse 
of the first one. 


Theorem 4.3.3 (Translation Identities). If L[f(t)](s) exists for s > a, then 
Lilutt—oe)f(t-ol=e KL), s> a, e20 (4.3.3) 
Lie“ f(t)] = LIF) (s — oe), s>at+e, ceER. 


Example 4.3.6. Take f(t) = cos(t) and write the equations given the Theorem above. 


Solution: 
8 Llu(t — c) cos(t — c)] = e~° a 
L{cos(t)] = re ad eae _(s-0) | 
(s—c)? +1" 


Remarks: 


4.3. DISCONTINUOUS SOURCES 201 


(a) We can highlight the main idea in the theorem above as follows: 
Llright-translation (uf)] = (exp) (L[f]), 

L£{(exp) (f)| = translation(L[f]). 

(b) Denoting F'(s) = L[f(t)], then an equivalent expression for Eqs. (4.3.3)-(4.3.4) is 
Liu(t — ec) f(t—e)] =e © F(s), 
Lie“ f(t)] = F(s — ¢). 

(c) The inverse form of Eqs. (4.3.3)-(4.3.4) is given by, 

£L-*e~* F(s)] = u(t —c) f(t—o), 

LF (s—)| =e" fi(e). 


(d) Eq. (4.3.4) holds for all c € R, while Eq. (4.3.3) holds only for c > 0. 
(e) Show that in the case that c < 0 the following equation holds, 


lel 
elult-+ le) fte+ led = el@*(etsco]— fe" seat). 


Proof of Theorem 4.3.3: The proof is again based in a change of the integration variable. 
We start with Eq. (4.3.3), as follows, 


fieete—oi= , e-*tu(t — 0) f(t — 0) dt 
= [yea tT=t-—c, dr=dt, cz>0, 
i: e8(t +) f(r) dt 


ail e °" f(r) dr 
(0) 

=e"Lf), s>a. 
The proof of Eq. (4.3.4) is a bit simpler, since 


cles] = [eres des [oe 9'7@) ae = LUOMs— 0), 


(0) (0) 


which holds for s —c >a. This establishes the Theorem. 


Example 4.3.7. Compute L[u(t — 2) sin(a(t — 2))]. 


Solution: Both L{sin(at)] = za and Liu(t — c) f(t — c)] =e" Lif (t)] imply 
L[u(t — 2) sin(a(t — 2))] = e~?* L[sin(at)] = e~?° a 
We conclude: L[u(t — 2) sin(a(t — 2))] = SS < 


Example 4.3.8. Compute £[e* sin(at)]. 
Solution: Since Lie“ f(t)] = L[f](s — c), then we get 


3t gs = a 
Lie iy sin(at)] = (s—3)? +a?’ 8s > 3; 
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Example 4.3.9. Compute both £[u(t — 2) cos(a(t — 2))] and £Lie** cos(at)]. 


: s 
Solution: Since £[cos(at)] = Pa? then 
—2s : 3t (s— 3) 
Llu(t — 2) cos(a(t — 2))] =e CET Lle** cos(at)| (e-3 +a 


Example 4.3.10. Find the Laplace transform of the function 


0 t<l, 


ft) = { (2-242) t>1 (4.3.7) 


Solution: The idea is to rewrite function f so we can use the Laplace transform Table 1, 
in § 4.1 to compute its Laplace transform. Since the function f vanishes for all t < 1, we 
use step functions to write f as 


f(t) = u(t — 1)(? — 2t+ 2). 
Now, notice that completing the square we obtain, 
?—2t+2= ( —2¢4+1)-14+2=(¢-1)?+1. 
The polynomial is a parabola t? translated to the right and up by one. This is a discontinuous 
function, as it can be seen in Fig. 5. 

So the function f can be written as follows, 
f(t) =u(t-1)(¢-1)? + u(t - 1). 
Since we know that L[t?] = = then 

Eq. (4.3.3) implies 
Lif(t)] = Llu(t — 1) (t- 1)?} + Llu(t — V] 


Ye u(t — 1) [¢-1)? +1] 


ee 
— es 2 esl 
° 8 8 
so we get 
LUf@)] = (2 +8”). FIGuRE 5. Function f given 
‘ a in Eq. (4.3.7). 
e 48 


Example 4.3.11. Find the function f such that L[f(t)] = 


Solution: Notice that 


Lif(t)] = = e~ 48 £fsin(V5t)]. 


But the translation identity 


e @ LIf()] = Llu(t — o) f(t — ©)] 
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implies 


cUf(t)] = = Llu(t — 4) sin(V5 (t — 4))], 
hence we obtain 
fo = = u(t — 4) sin(v5 (t — 4)). 
dq 
Example 4.3.12. Find the function f(t) such that L[f(¢)] = ae 
Solution: We first rewrite the right-hand side above as follows, 
eo ee ee 
L£If(t)] = “@=32 435 
2 = 2) 1 
~ (s—2)243 ° (s—2)?43 
__ 6-9) (1  ¥3 
(s—2)2+ (v3) V3 (s— 2)? + (v3) 
= Lieea/3 Ole 2) 4 5 Lisin(V3t)|(s — 2). 
But the translation identity L[f(t)|(s —c) = L[e“ f (t)] implies 
Lif (t)] = L[e** cos(v3 t)] + sae le sin(V3t)]. 
So, we conclude that 
2t 
fij= a [v3 cos(V3t) + sin(V3¢)], 
dq 
Example 4.3.13. Find £7! ae 
ple 4.3.13. oA’ 
Solution: Since £7’ E * =| = sinh(at) and £~*[e~* f(s)| = u(t —c) f(t —c), then 
26-38 2 2e-3s . 
co ES “| =¢-4 ie a4 sages E : 7 = u(t — 3) sinh(2(t — 3). 
dq 
Example 4.3.14. Find a function f such that L[f(¢)] = — 


Solution: Since the right hand side above does not appear in the Laplace transform Table 
in § 4.1, we need to simplify it in an appropriate way. The plan is to rewrite the denominator 
of the rational function 1/(s?+s—2), so we can use partial fractions to simplify this rational 
function. We first find out whether this denominator has real or complex roots: 


1 


§, =1, 
Sp =- [-1 aE V1+8] => 
2 s_ =-2. 
We are in the case of real roots, so we rewrite 


s?+8s—2=(s—1)(s+2). 
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The partial fraction decomposition in this case is given by 


1 — @ , 6 _ (a+b)s+(2a—5) a+b=0, 
(s—1)(s+2)” (s—1) (8+2)” (s—1)(s+2) 2a—b=1. 
The solution is a = 1/3 and b = —1/3, so we arrive to the expression 
12, 1 1: aye. 
ON ee a ee 
Recalling that 
1 
Lle™] = ; 
s—a 


and Eq. (4.3.3) we obtain the equation 
1 1 
Lif (t)] = 3 Llu(t — 2) e(t2)] <5 L[u(t — 2) aa || 
which leads to the conclusion: 


f= a — 2) jet) _ e129), 
<J 


4.3.4. Solving Differential Equations. The last three examples in this section show 
how to use the methods presented above to solve differential equations with discontinuous 
source functions. 


Example 4.3.15. Use the Laplace transform to find the solution of the initial value problem 
yt2y=ult—4), (0) = 3. 


Solution: We compute the Laplace transform of the whole equation, 


Lly’] + 2 Ly] = Llu(t — 4)] = 


From the previous section we know that 
—4s —As 


[sLly] — y(0)] +2Ly]=—— > (8 +2) Lly] = (0) + 


8 
We introduce the initial condition y(0) = 3 into equation above, 
3 1 
Ly] =— +e" OL = 3L[e “| +e 
NS ay ae) WW aaa 3(s + 2) 


We need to invert the Laplace transform on the last term on the right hand side in equation 
above. We use the partial fraction decomposition on the rational function above, as follows 


1 a b a(s+2)+bs (a+b)s+ (2a) a+b=0, 
s(s +2). (s +2) s(s + 2) ~ s(s + 2) = { 2a = 1. 
We conclude that a = 1/2 and b = —1/2, so 
1 1yl 1 
aa OG ool 


We then obtain 


Lly] = aLle™| + : ie : eo @ : mt 


=3L[e**] + ; (Clute —4)] — £[u(t— 4) ete-9]). 
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Hence, we conclude that 
I j 
y(t) = Se" + = ult — 4) [1 = eXt-4)| 
J 


Example 4.3.16. Use the Laplace transform to find the solution to the initial value problem 


i) 1 O<t<a7 
" / = / = _ SS 


Solution: From Fig. 6, the source function b can be written as 


b(t) = u(t) — u(t — 7). 


FIGURE 6. The graph of the u, its translation and b as given in Eq. (4.3.8). 


The last expression for b is particularly useful to find its Laplace transform, 


L[b(t)] = Liu(t)] — Liule matters i £[p(t)] = (1 e**) = 


Now Laplace transform the whole equation, 
5 
Ly") + Ly!) + 5 fly) = £18). 


Since the initial condition are y(0) = 0 and y’(0) = 0, we obtain 


(3° +s+ °) Lly] = (1 —e7**) : => Ly) =(1-e77) 


Introduce the function 
1 


5 
s (s° +s+ 3) 
That is, we only need to find the inverse Laplace transform of H. We use partial fractions to 


simplify the expression of H. We first find out whether the denominator has real or complex 
roots: 


H(s) = => y(t) =L~"[H(s)] —£"[e“™* A(s)]. 


5 1 
s+st+7=0 ss = 5[-14v1-5], 
so the roots are complex valued. An appropriate partial fraction decomposition is 
1 a (bs +c) 


A = = 
(s) s(s?+s+2 s (s@+s+3 
4 4 


Therefore, we get 
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This equation implies that a, b, and c, satisfy the equations 
5 
a+b=0, at+c=0, geal. 


4 4 4 
The solution is, a 5? b 5? c 5° Hence, we have found that, 


1 aL (s+1) 


a) (#4548) 5 


Complete the square in the denominator, 


5 1 1 1 5 1)? 
s+ 342 = [57 +2/ )s4 + =(s+5) +1. 


2 4 4 4 2 
Replace this expression in the definition of H, that is, 
471 (s+1) 
H(s) = = |; - 
(sta) +4] 


Rewrite the polynomial in the numerator, 


(s+1)=(st5+5)=(st5)+5 
pee ke leg, ay Ne ah a 


hence we get 


1 
471 (s + $) 1 1 
H(s)=3[- war ee 
[(s+3) +] (st) +] 
Use the Laplace transform table to get H(s) equal to 


om = [tt ~ £[e-*!? cos(t)] — sele" «sin(t)] 


equivalently ; 
H(s)= £|= (1 =¢ 4 eps) sin(t)) | 
Denote 
i= [1 — e#? cos(t) — Se? sin(t)]. = Lih(d)). 
Recalling £Liy(t)] = H(s) +e -** H(s), we en Liy(t)] = LiA(t . e “*LIA(t)], that is, 


y(t) = h(t) + u(t — w)h(t — 7). 
J 


Example 4.3.17. Use the Laplace transform to find the solution to the initial value problem 
5 sin(t) O0O<t<7 
” ! =s — / = i co 
yotyt+qzy=gt), yO) =0, 0) =9, gt) = { 0 toa. (43-9) 


Solution: From Fig. 7, the source function g can be written as the following product, 


g(t) = [u(t) — u(t — 7)] sin(t), 
since u(t) — u(t — 7) is a box function, taking value one in the interval [0,7] and zero on 
the complement. Finally, notice that the equation sin(t) = —sin(t — 7) implies that the 
function g can be expressed as follows, 


g(t) = u(t) sin(t) — u(t— 7) sin(t) => — g(t) = u(t) sin(t) + u(t — 7) sin(t — 7). 


The last expression for g is particularly useful to find its Laplace transform, 


4.3. DISCONTINUOUS SOURCES 207 


FIGuRE 7. The graph of the sine function, a square function u(t) — u(t—7) 
and the source function g given in Eq. (4.3.9). 


1 1 
Lig(t)| = ——~ NE 
WOl= Gan te Gran 
With this last transform is not difficult to solve the differential equation. As usual, Laplace 
transform the whole equation, 


Lly'| + Lief] + 2 Lie) = £19} 


Since the initial condition are y(0) = 0 and y’(0) = 0, we obtain 
1 


(s?+5+4) (s241) 


(+542) Lol= (+e) a. ype 


(s? + 1) 
Introduce the function 


H(s) = : 


(s?+5+) (s? +1) 


y(t) = £-"|H(s)| + £-"[e“** H(s)]. 


That is, we only need to find the Inverse Laplace transform of H. We use partial fractions 
to simplify the expression of H. We first find out whether the denominator has real or 
complex roots: 


5 1 
ae ear el st =5[-ltv1-5], 
so the roots are complex valued. An appropriate partial fraction decomposition is 
1 (as + b) (cs + d) 


4Q)= Gps Dea) Stet) Sr 


Therefore, we get 


1= (as +6)(s? +1) + (cs +d)(s? +542), 


equivalently, 


L=(ates+(btetd)s?+(atte+d)s+(b+24). 


This equation implies that a, b, c, and d, are solutions of 


5 
a+c=0, b+c+d=0, at Pet+d=0, b+7d=1. 
Here is the solution to this system: 
1 12 16 4 
ioe b= — ——— d= — 


We have found that, 


_ =| (4s + 3) , (4s +1) 
~~ F (s2 +5 + 3) (s? + 1) 
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Complete the square in the denominator, 
5 1 1 1 5 1\2 
&+s+ = ([s%4+2(5) e+] zt+s=(s4 )o+1. 
4 (4s + 3) (—4s +1) 
(8) = dl 244) G41 | 
(se iyray (+) 
Rewrite the polynomial in the numerator, 


(48+) =4(s+ 5-5) +3=4(s+5) +1, 


hence we get 


H(s) 


4 (s+ 3) 1 8 1 
= 2 2 Sree Teg : 
Wh [(s+2)° +1) [(e+4)° +2) (9? +1) Ge? +1) 

Use the Laplace transform Table in 1 to get H(s) equal to 


His) = = [4c ew cos(t)} + Elie sin(t)] — 4 L[cos(t)] + Lfsin(¢)]| , 
equivalently 
ier E (4e-¥/? cos(t) + e~/? sin(t) — 4 cos(t) + sin(#)) 


Denote 
h(t) = < [sen¥? cos(t) +e"? sin(t) —4eos(t) +sin(t)] = H(s) = £[A(2)) 
Recalling Cy =A) eG), we obtain Lith) SEIN) ee AO), ets, 
it) = 1) +h — 2. 
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4.3.5. Exercises. 


4.3.1.- . 4.3.2.-. 
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4.4. Generalized Sources 


We introduce a generalized function—the Dirac delta. We define the Dirac delta as a limit 
n — co of a particular sequence of functions, {6,}. We will see that this limit is a function 
on the domain R — {0}, but it is not a function on R. For that reason we call this limit a 
generalized function—the Dirac delta generalized function. 

We will show that each element in the sequence {6,,} has a Laplace transform, and this 
sequence of Laplace transforms {L£[d,,]} has a limit as n > oo. We use this limit of Laplace 
transforms to define the Laplace transform of the Dirac delta. 

We will solve differential equations having the Dirac delta generalized function as source. 
Such differential equations appear often when one describes physical systems with impulsive 
forces—forces acting on a very short time but transfering a finite momentum to the system. 
Dirac’s delta is tailored to model impulsive forces. 


4.4.1. Sequence of Functions and the Dirac Delta. A sequence of functions is 
a sequence whose elements are functions. If each element in the sequence is a continuous 
function, we say that this is a sequence of continuous functions. Given a sequence of func- 
tions {y,}, we compute the limp... yn(t) for a fixed t. The limit depends on t, so it is a 
function of t, and we write it as 

Jim yn(t) = y(t). 

The domain of the limit function y is smaller or equal to the domain of the y,,.. The limit 
of a sequence of continuous functions may or may not be a continuous function. 


Example 4.4.1. The limit of the sequence below is a continuous function, 


1 
{ fa(t) = sin((1 + —)t)} > sin(t) as no. 

n 

As usual in this section, the limit is computed for each fixed value of t. <J 
However, not every sequence of continuous functions has a continuous function as a 

limit. 
Example 4.4.2. Consider now the following se- 
quence, {u,}, for n > 1, 


0, t<0 
1 
Un(t) = 4 Ms OSE (4.4.1) 
I, t>—. 
Tr 


This is a sequence of continuous functions whose 
limit is a discontinuous function. From the few 
graphs in Fig. 8 we can see that the limit n — oo 
of the sequence above is a step function, indeed, 
limyp—+oo Un(t) = u(t), where 


{' for t <0, 
u(t) = 
(t) 1 for ¢>0. FIGURE 8. A few func- 


tions in the sequence 


We used a tilde in the name w because this step 
function is not the same we defined in the previous {Un}. 
section. The step u in § 4.3 satisfied u(0) =1. < 


Exercise: Find a sequence {up} so that its limit is the step function u defined in § 4.3. 
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Although every function in the sequence {u,,} is continuous, the limit & is a discon- 
tinuous function. It is not difficult to see that one can construct sequences of continuous 
functions having no limit at all. A similar situation happens when one considers sequences 
of piecewise discontinuous functions. In this case the limit could be a continuous function, 
a piecewise discontinuous function, or not a function at all. 

We now introduce a particular sequence of piecewise discontinuous functions with do- 
main R such that the limit as nm — oo does not exist for all values of the independent variable 
t. The limit of the sequence is not a function with domain R. In this case, the limit is a 
new type of object that we will call Dirac’s delta generalized function. Dirac’s delta is the 
limit of a sequence of particular bump functions. 


Definition 4.4.1. The Dirac delta generalized function is the limit 


6(t) = lim 64,(t), 


noo 


for every fired t € R of the sequence functions {dn} 4, 


bn(t) =n [u(t) — u(t - =)]. (4.4.2) 


The sequence of bump functions introduced above 
can be rewritten as follows, 


0, t<0 

1 

&OQ=< ™ a 
6 t2—. 
nr 


We then obtain the equivalent expression, 


a = {7 for t 40, 


oo for t=0. 


Remark: It can be shown that there exist infin- 
itely many sequences {6,,} such that their limit as 
nm — oo is Dirac’s delta. For example, another 
sequence is 


~ 1 
bn(t) =n [u(t + aa) —u(t— =)| 
1 
0, t<-— 
l 2n 
Sg Se ee FIGURE 9. A few func- 
2n 1 2n tions in the sequence 
0, t> on {on}. 
n 


The Dirac delta generalized function is the function identically zero on the domain 
R — {0}. Dirac’s delta is not defined at t = 0, since the limit diverges at that point. If we 
shift each element in the sequence by a real number c, then we define 


d(t—c) = lim dn(é—), ceER. 
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This shifted Dirac’s delta is identically zero on R — {c} and diverges at t = c. If we shift 
the graphs given in Fig. 9 by any real number c, one can see that 


c+1 
/ bn(t — c) dt = 1 


for every n > 1. Therefore, the sequence of integrals is the constant sequence, {1,1,---}, 
which has a trivial limit, 1, as n - oo. This says that the divergence at t = c of the sequence 
{6,} is of a very particular type. The area below the graph of the sequence elements is always 
the same. We can say that this property of the sequence provides the main defining property 
of the Dirac delta generalized function. 

Using a limit procedure one can generalize several operations from a sequence to its 
limit. For example, translations, linear combinations, and multiplications of a function by 
a generalized function, integration and Laplace transforms. 


Definition 4.4.2. We introduce the following operations on the Dirac delta: 


f(t) 6(t —€) + g(t) 6(t — c) = lim [F(#) dn(t — €) + g(t) bnlt — 0)], 


= 


[u- ous sim, [ult c) dt, 
Li6(t —c)] = lim L[6,(t — c)]. 


noo 


Remark: The notation in the definitions above could be misleading. In the left hand 
sides above we use the same notation as we use on functions, although Dirac’s delta is not 
a function on R. Take the integral, for example. When we integrate a function f, the 
integration symbol means “take a limit of Riemann sums”, that is, 


b n 
. : b—a 
| f(j)dt= pe DA eo Ag, vj =at+ida, Ag = . 


n 


However, when f is a generalized function in the sense of a limit of a sequence of functions 
{fn}, then by the integration symbol we mean to compute a different limit, 


b b 
- f(Q)dt= lim | f(t) dt. 


We use the same symbol, the integration, to mean two different things, depending whether 
we integrate a function or a generalized function. This remark also holds for all the oper- 
ations we introduce on generalized functions, specially the Laplace transform, that will be 
often used in the rest of this section. 


4.4.2. Computations with the Dirac Delta. Once we have the definitions of op- 
erations involving the Dirac delta, we can actually compute these limits. The following 
statement summarizes few interesting results. The first formula below says that the infinity 
we found in the definition of Dirac’s delta is of a very particular type; that infinity is such 
that Dirac’s delta is integrable, in the sense defined above, with integral equal one. 


cte 
Theorem 4.4.3. For every c€ R and € > 0 holds, / d(¢—c)dt =1. 


c=eE 
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Proof of Theorem 4.4.3: The integral of a Dirac’s delta generalized function is computed 
as a limit of integrals, 


_- n+ co cC—eE 


cte cte 
/ d(t—c)dt = lim On (t — c) dt. 


If we choose n > 1/e, equivalently 1/n < €, then the domain of the functions in the sequence 
is inside the interval (c — e«,c + €), and we can write 


c+e e+ 1 
; d(t —c)dt = lim n dt, for —<e 
—_ n-+00 J, n 


Then it is simple to compute 


c+te 1 
/ d(¢—c)dt = lim n(c+——c) = lim 1=1. 
é noo n noo 


This establishes the Theorem. 


The next result is also deeply related with the defining property of the Dirac delta—the 
sequence functions have all graphs of unit area. 


b 
Theorem 4.4.4. If f is continuous on (a,b) and c € (a,b), then | f(t) 6(¢-—c) dt = f(c). 


Proof of Theorem 4.4.4: We again compute the integral of a Dirac’s delta as a limit of 
a sequence of integrals, 


b 

= Jim, n[u(t Cc) u(t c *)] f(t) dt 
c+ 1 

= lim n f (t) dt, ms (6—<), 


To get the last line we used that c € [a,b]. Let F be any primitive of f, so F(t) = f f(t) dt. 
Then we can write, 


b 
/ d(t — c) f(t) dt = lim in F(c+ —) — F(o)] 


a Jim, ay leet *) — F(c)] 


lI 

a 
— 

ie) 
NS 


This establishes the Theorem. 


In our next result we compute the Laplace transform of the Dirac delta. We give two 
proofs of this result. In the first proof we use the previous theorem. In the second proof we 
use the same idea used to prove the previous theorem. 


e ° jor eG, 


Theorem 4.4.5. For all s € R holds £L[d(t — c)] = f gs aac 
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First Proof of Theorem 4.4.5: We use the previous theorem on the integral that defines 
a Laplace transform. Although the previous theorem applies to definite integrals, not to 
improper integrals, it can be extended to cover improper integrals. In this case we get 


e °° for c>0, 


L[d(t —c)] = I ee" §(t—c)dt= i 


for c< 0, 


This establishes the Theorem. 


Second Proof of Theorem 4.4.5: The Laplace transform of a Dirac’s delta is computed 
as a limit of Laplace transforms, 


£[6(t— ¢)] = im, £[ba(t— 2] 


= lim £[n[u(t 0) - u(t -e~ =)]] 
= Jin, [nfo ae Dera 


1 
The case c < 0 is simple. For — < |c| holds 
n 


L[d(t — c)] = lim Odt => Lid(t—c] =0, forsER, c<0. 


n—->co 0 


Consider now the case c > 0. We then have, 


c++ 
= on " —st 
Lid(t — c)] Jim : ne * dt. 
For s = 0 we get 
c++ 
Li[d(t — c)] = lim ndt=1 => L[d(t-—c))=1 fors=0, c 0. 
noo c 
In the case that s 4 0 we get, 
ctt —s 
_ tk —st _ ty n cs ct+)s\ _ cs 4: 1—e n) 
L[a(é—c)] = lim : ne dt = lim ——(e —@ (eran) )=e dim, ©) 
n 


The limit on the last line above is a singular limit of the form 2, so we can use the l’H6pital 


0? 
rule to compute it, that is, 


We then obtain, 
Li[o(t—c)) =e % fors #0, c 0. 
This establishes the Theorem. 


4.4.3. Applications of the Dirac Delta. Dirac’s delta generalized functions describe 
impulsive forces in mechanical systems, such as the force done by a stick hitting a marble. 
An impulsive force acts on an infinitely short time and transmits a finite momentum to the 
system. 
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Example 4.4.3. Use Newton’s equation of motion and Dirac’s delta to describe the change 
of momentum when a particle is hit by a hammer. 


Solution: A point particle with mass m, moving on one space direction, x, with a force F 
acting on it is described by 


ma=F cs ma’ (t) = F(t, w(t) ), 


where x(t) is the particle position as function of time, a(t) = x(t) is the particle acceleration, 
and we will denote v(t) = x’(t) the particle velocity. We saw in § 1.1 that Newton’s second 
law of motion is a second order differential equation for the position function x. Now it is 
more convenient to use the particle momentum, p = mv, to write the Newton’s equation, 


mz =mv' = (mv) =F ee 


So the force F' changes the momentum, P. If we integrate on an interval [t,,t,| we get 


Ap = plts) ~ w(t.) = f° Plta(t)at 


1 


Suppose that an impulsive force is acting on a particle at t) transmitting a finite momentum, 
say Po. This is where the Dirac delta is uselful for, because we can write the force as 


F(t) = po d(t — to), 
then F' = 0 on R— {to} and the momentum transferred to the particle by the force is 
to+At 
Ap= | Do O(t — to) dt = po. 
to—At 


The momentum tranferred is Ap = pp, but the force is identically zero on R— {t,}. We have 
transferred a finite momentum to the particle by an interaction at a single time fo. <J 


4.4.4. The Impulse Response Function. We now want to solve differential equa- 
tions with the Dirac delta as a source. But there is a particular type of solutions that will 
be important later on—solutions to initial value problems with the Dirac delta source and 
zero initial conditions. We give these solutions a particular name. 


Definition 4.4.6. The impulse response function at the point c > 0 of the constant 
coefficients linear operator L(y) = y" + ary’ + aoy, is the solution ys of 


Remark: Impulse response functions are also called fundamental solutions. 


Theorem 4.4.7. The function ys is the impulse response function at c > 0 of the constant 
coefficients operator L(y) =y" +a,y' + aoy iff holds 


ae 


where p is the characteristic polynomial of L. 


Remark: The impulse response function ys at c = 0 satifies 


wae [sy 
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Proof of Theorem 4.4.7: Compute the Laplace transform of the differential equation for 
for the impulse response function y;, 


Lly”| + ar Ly] + a Ly] = L[6(t — c)] =e. 
Since the initial data for ys is trivial, we get 
(s? + a,8 + ao) Lly] =e". 


Since p(s) = s? + ays + dp is the characteristic polynomial of L, we get 
= 


uo =e]. 


All the steps in this calculation are if and only ifs. This establishes the Theorem. 


—cs 


Example 4.4.4. Find the impulse response function at t = 0 of the linear operator 
L(y) = y" + 2y! + 2y. 


Solution: We need to find the solution ys of the initial value problem 
Since the souce is a Dirac delta, we have to use the Laplace transform to solve this problem. 
So we compute the Laplace transform on both sides of the differential equation, 

Llys] + 2L[ys] + 2L[ys] = L[5()}=1 = (8? +28 +2) L[ys] = 1, 
where we have introduced the initial conditions on the last equation above. So we obtain 

1 
Llys| = —— ———.. 

[vs] (s? + 25 + 2) 
The denominator in the equation above has complex valued roots, since 


ee 5[-2+ Vi-8], 


therefore, we complete squares s? + 2s + 2=(s+1)?+ 1. We need to solve the equation 


1 
Llys] = (s+)? 41) 


=Lle'sin(t)]) => ys(t) =e‘ sin(t). 
dq 


Example 4.4.5. Find the impulse response function at t = c > 0 of the linear operator 
Liy) =y" +2y' +2y. 


Solution: We need to find the solution ys of the initial value problem 
yg t2y5+2y5=d(t—c), ys(0)=0, y5(0) =0. 


We have to use the Laplace transform to solve this problem because the source is a Dirac’s 
delta generalized function. So, compute the Laplace transform of the differential equation, 


Llys] + 2L[ys] + 2L[ys] = £[d(t — ©]. 


Since the initial conditions are all zero and c > 0, we get 


—cs 


249542 =e = __, 
(s* + 25+ 2) Lys] =e => Llys| (s? + Ds 3) 


Find the roots of the denominator, 


s?4+2s+2=0 sp = [-2+ 4-8 
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The denominator has complex roots. Then, it is convenient to complete the square in the 
denominator, 


2 
s?+28+2= [s?+2(5) s+] —1+2=(s+1)?+1. 


Therefore, we obtain the expression, 


Recall that L/sin(t)] = 2 ~ 1 and L[f](s — c) = Lie“ f(t)]. Then, 


(s+1241— Lie sin(t)] => Llys] =e" Lle~ sin(¢)]. 


Since for c > 0 holds e“ Li f](s) = Liu(t — c) f(t — c)], we conclude that 
ys(t) = u(t — c)e~*- sin(t — c). 


Example 4.4.6. Find the solution y to the initial value problem 
y" —y=—200(t-3), (0) =1, —-y"(0) = 0. 
Solution: The source is a generalized function, so we need to solve this problem using the 
Lapace transform. So we compute the Laplace transform of the differential equation, 
Lly"| — Lly] = -20c[6(¢-3)] => (s?-1)Lly] -—s=—20e-*, 


where in the second equation we have already introduced the initial conditions. We arrive 
to the equation 


Lily] = an oie" (1) 


which leads to the solution 
y(t) = cosh(t) — 20 u(t — 3) sinh(t — 3). 


= L{cosh(t)| — 20 L[u(t — 3) sinh(¢ — 3)), 


Example 4.4.7. Find the solution to the initial value problem 
y + 4y = 6(t — 7) — 6(t — 2rr), y(0) = 0, y' (0) = 0. 


Solution: We again Laplace transform both sides of the differential equation, 
Lly!"| +4L[y] = Ll6(t -— x)] — L[6(t-—20)]| => (8? +4) Ly) =e"? —e-?"*, 


where in the second equation above we have introduced the initial conditions. Then, 


e77s e727 
L = 
W= Gah rH 
ets 9 e7 27s 2 


2 (s?+4) 2  (s? +4) 
ae [w(e —m) sin[2(t n)]] 


1 
sf [ult 2) sin[2(t — 2n)]] 
The last equation can be rewritten as follows, 


y(t) = ; u(t — m) sin[2(t — 7)] — : u(t — 2m) sin[2(t — 27)], 
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which leads to the conclusion that 


y(t) = = [u(t — m) — u(t — 2m)] sin(2t). 


dq 


4.4.5. Comments on Generalized Sources. We have used the Laplace transform to 
solve differential equations with the Dirac delta as a source function. It may be convenient 
to understand a bit more clearly what we have done, since the Dirac delta is not an ordinary 
function but a generalized function defined by a limit. Consider the following example. 


Example 4.4.8. Find the impulse response function at t = c > 0 of the linear operator 
Ly) =y'. 
Solution: We need to solve the initial value problem 
y(t)=4(t-0), (0) =0. 
In other words, we need to find a primitive of the Dirac delta. However, Dirac’s delta is not 


even a function. Anyway, let us compute the Laplace transform of the equation, as we did 
in the previous examples, 


Lily] =La(t-o)] => slly(t]-yO)=e° = Lly(t)]= 


But we know that 


Looking at the differential equation y’(t) = 6(t — c) and at the solution y(t) = u(t — c) one 
could like to write them together as 


u(t —c) = 6(t—c). (4.4.3) 


But this is not correct, because the step function is a discontinuous function at t = c, hence 
not differentiable. What we have done is something different. We have found a sequence of 
functions u,, with the properties, 


. = -_ . ¥ = a a 
Jim, Un(t — c) = u(t — c), Jim, u,,(t — c) = d(t —c), 


and we have called y(t) = u(t—c). This is what we actually do when we solve a differential 
equation with a source defined as a limit of a sequence of functions, such as the Dirac delta. 
The Laplace transform method used on differential equations with generalized sources allows 
us to solve these equations without the need to write any sequence, which are hidden in the 
definitions of the Laplace transform of generalized functions. Let us solve the problem in 
the Example 4.4.8 one more time, but this time let us show where all the sequences actually 
are. 


Example 4.4.9. Find the solution to the initial value problem 


y (t) = 6(t—c), y(0) = 0, c>0, (4.4.4) 


Solution: Recall that the Dirac delta is defined as a limit of a sequence of bump functions, 


d(t — c) = lim bn(t—¢), dn(t — c) = n[u(t c) u(t c = n=1,2,---. 


nr 
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The problem we are actually solving involves a sequence and a limit, 
y(t) = lim dn(t—c), (0) = 9. 
We start computing the Laplace transform of the differential equation, 
Lly'(t)| = Lf lim dn (t — ©). 
We have defined the Laplace transform of the limit as the limit of the Laplace transforms, 
Lly'(t)] = lim. £{6,(¢ — 0). 
If the solution is at least piecewise differentiable, we can use the property 
Lly'(t)] = sL[y(t)] — y(0). 
Assuming that property, and the initial condition y(0) = 0, we get 


Lly(] =~ tim Llint—o)] > Lly(e)] = Jim “PE 


noo Ss 


Introduce now the function y,(t) = u,(t — c), given in Eq. (4.4.1), which for each n is the 
only continuous, piecewise differentiable, solution of the initial value problem 


Yn(t) =On(t—c), — Yyn(0) = 0. 
It is not hard to see that this function u,, satisfies 
Llu, (t)] = Lion (t — e)) 
8 
Therefore, using this formula back in the equation for y we get, 
Lly(t)] = lim L[un(t)) 
For continuous functions we can interchange the Laplace transform and the limit, 


Lly(t)] = Lf lim un (¢)). 


So we get the result, 
y(t) = lim up(t) => y(t) =u(t—-c). 
noo 
We see above that we have found something more than just y(t) = u(t —c). We have found 
y(t) = im Un(t— c), 
where the sequence elements u,, are continuous functions with u,,(0) = 0 and 
Jim, Un(t — c) = u(t — c), Jim u,,(t —c) = d(t—c), 
Finally, derivatives and limits cannot be interchanged for un, 
s / . / 
tim, [ul (t—c)] # [ lim. Un(t — c)] 

so it makes no sense to talk about y’. <J 

When the Dirac delta is defined by a sequence of functions, as we did in this section, 
the calculation needed to find impulse response functions must involve sequence of functions 
and limits. The Laplace transform method used on generalized functions allows us to hide 


all the sequences and limits. This is true not only for the derivative operator L(y) = y’ but 
for any second order differential operator with constant coefficients. 
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Definition 4.4.8. A solution of the initial value problem with a Dirac’s delta source 
y tay tay=dt—c), y(0)=y, yO=n, (4.4.5) 
where a1, Qo; Yo, ¥1, andc ER, are given constants, is a function 
y(t) = lim yn(t), 
where the functions y,, with n > 1, are the unique solutions to the initial value problems 
Yn +41 Yy +40Yn =5n(t—¢), — Yn(0) = 4, Yn (0) = th, (4.4.6) 
and the source bn, satisfy limp+oo On(t — c) = 6(t — c). 


The definition above makes clear what do we mean by a solution to an initial value problem 
having a generalized function as source, when the generalized function is defined as the limit 
of a sequence of functions. The following result says that the Laplace transform method 
used with generalized functions hides all the sequence computations. 


Theorem 4.4.9. The function y is solution of the initial value problem 
y+arytary=d(t—c), yO)=%, y(O)=H, e209, 
iff its Laplace transform satisfies the equation 


(s? Lly] — syo — y1) + a1 (s Lly] — yo) — ao Lly] =e. 


This Theorem tells us that to find the solution y to an initial value problem when the source 
is a Dirac’s delta we have to apply the Laplace transform to the equation and perform the 
same calculations as if the Dirac delta were a function. This is the calculation we did when 
we computed the impulse response functions. 

Proof of Theorem 4.4.9: Compute the Laplace transform on Eq. (4.4.6), 


Llyn] + as Llyn] + a0 L[Yn] = Ldn (t — ©)].- 

Recall the relations between the Laplace transform and derivatives and use the initial con- 
ditions, 

Llyn] = 8" Llyn] —syo—y, Ly] = 8 Llyn] — yo, 
and use these relation in the differential equation, 

(s? + a8 + do) L[Yn] — 8¥o — Ys — Ayo = L[6n(t — ©)], 
Since 6, satisfies that limy-... dn(t — c) = 6(¢—c), an argument like the one in the proof of 
Theorem 4.4.5 says that for c > 0 holds 


Lilt] = L[5(t-o)] > lim Ll, (t—c)] = e7**. 


n—->oco 


Then 


(s? + 418+) lim Llyn] — sYo — Ys — G1Yo = 
noo 


Interchanging limits and Laplace transforms we get 
(s? + a18 + ao) L[y] — syo — 41 — aio =e, 
which is equivalent to 
(s* Lily] — syo — 41) + a1 (sL[y] — yo) — ao L[y] =e. 
This establishes the Theorem. 


4.4. 


4.4.6. Exercises. 


GI 


ENERALIZED SOURCES 


4.4.1.- * Find the solution to the initial 4.4.2.-. 


value problem 


y” — 8y’ + 16y = cos(rt) 6(t — 1), 
y(0)=0, —-y/(0) =0. 
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4.5. Convolutions and Solutions 


Solutions of initial value problems for linear nonhomogeneous differential equations can be 
decomposed in a nice way. The part of the solution coming from the initial data can be 
separated from the part of the solution coming from the nonhomogeneous source function. 
Furthermore, the latter is a kind of product of two functions, the source function itself and 
the impulse response function from the differential operator. This kind of product of two 
functions is the subject of this section. This kind of product is what we call the convolution 
of two functions. 


4.5.1. Definition and Properties. One can say that the convolution is a general- 
ization of the pointwise product of two functions. In a convolution one multiplies the two 
functions evaluated at different points and then integrates the result. Here is a precise 
definition. 


Definition 4.5.1. The convolution of functions f and g is a function f * g given by 


(f * g(t) = / f(r)g(t — 7) dr. (4.5.1) 


Remark: The convolution is defined for functions f and g such that the integral in (4.5.1) is 
defined. For example for f and g piecewise continuous functions, or one of them continuous 
and the other a Dirac’s delta generalized function. 


Example 4.5.1. Find f * g the convolution of the functions f(t) = e~* and g(t) = sin(t). 


Solution: The definition of convolution is, 
t 
(f*xg)(t) = | e * sin(t — Tr) dr. 
0 
This integral is not difficult to compute. Integrate by parts twice, 
t t 
i e* sin(t —7)dr = Ca cos(t — 7)| _ le“ sin(t — r)| _ / e” sin(t — rT) dr, 
0 10) 
that is, 
t 
2 | e” sin(t—T)dr = le" cos(t — r)| 
0 


We then conclude that 


t t 


10) (0) 


=e * — cos(t) — 0 + sin(t). 


(eave sie" + sin(t) — cos(#)]. (4.5.2) 


Example 4.5.2. Graph the convolution of 
f(r) = u(r) — u(r — 1), 


és 2Qe-77 for 7r>0 
ae 0 for 7 <0. 


Solution: Notice that 


es 2e77 for r<0 
ane 0 for T>0. 
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Then we have that 

2e-) for r<t 
t—r)=g(-(r-t 
et) = GE") f for T>t. 


In the graphs below we can see that the values of the convolution function f * g measure 
the overlap of the functions f and g when one function slides over the other. 
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FIGURE 10. The graphs of f, g, and f * g. 
dq 


A few properties of the convolution operation are summarized in the Theorem below. 
But we save the most important property for the next subsection. 


Theorem 4.5.2 (Properties). For every piecewise continuous functions f, g, and h, hold: 
(i) Commutativity: fx*xg=g*f; 
(ti) Associativity: fei(geh)=(f ¥q)*h; 
(ii) Distributivity: fx(gth)=fxgtfx*h; 
(iv) Neutral element: f «0 = 0; 
(v) Identity element: f *6 = f. 
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Proof of Theorem 4.5.2: We only prove properties (i) and (v), the rest are left as an 


exercise and they are not so hard to obtain from the definition of convolution. The first 
property can be obtained by a change of the integration variable as follows, 


(fx g)(t y= 90) g(t—7)d 


Now introduce the change of variables, 7 = t — 7, which implies dt = —dr, then 


(feg)(t)= | ft—T)g(7)(-) dr 


so we conclude that 
(f * 9)(t) = (9 * f(t). 


We now move to property (v), which is essentially a property of the Dirac delta, 


@= fra 5(t—r) dr = f(t). 


This establishes the Theorem. 


4.5.2. The Laplace Transform. The Laplace transform of a convolution of two func- 
tions is the pointwise product of their corresponding Laplace transforms. This result will 
be a key part in the solution decomposition result we show at the end of the section. 


Theorem 4.5.3 (Laplace Transform). If both Lig] and L[g] exist, including the case where 
either f or g is a Dirac’s delta, then 


L{f +g] = Lif] LIgl.- (4.5.3) 


Remark: It is not an accident that the convolution of two functions satisfies Eq. (4.5.3). 
The definition of convolution is chosen so that it has this property. One can see that this is 
the case by looking at the proof of Theorem 4.5.3. One starts with the expression L[f] £[g], 
then changes the order of integration, and one ends up with the Laplace transform of some 
quantity. Because this quantity appears in that expression, is that it deserves a name. This 
is how the convolution operation was created. 


Proof of Theorem 4.5.3: We start writing the right hand side of Eq. (4.5.1), the product 
Li f|L[g]. We write the two integrals coming from the individual Laplace transforms and 
we rewrite them in an appropriate way. 


cislcial=[f > eetreat] [fatal 


= i eg (f) (f- e*" f(t) dt) di 
=f o@( fo ese at) a 
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where we only introduced the integral in t as a constant inside the integral in t. Introduce 
the change of variables in the inside integral 7 = t+ t, hence dr = dt. Then, we get 


Lif Llal = fe lf) ¢s e" f(r — fdr) di (4.5.4) 
ao a oo) fr — dra. (4.5.5) 


Here is the key step. We must switch the order of 
integration. From Fig. 11 we see that changing the 
order of integration gives the following expression, 


Lif] Lig] = [ [ e*7 g(t) f(r — t) dt dr. 


Then, is straightforward to check that 


eine = fe (fof) ter — atl) ar 


= [rox tioae 


=Li[g*f] => LIfl|Llg] = Cif * gl. 
This establishes the Theorem. 


FIGURE 11. Domain of 
integration in (4.5.5). 


t 
Example 4.5.3. Compute the Laplace transform of the function u(t) = / e” sin(t — 
0 
T) dr. 


Solution: The function u above is the convolution of the functions 
f(t}=e", — g(t) =sin(), 

that is, u= f *g. Therefore, Theorem 4.5.3 says that 
Llu] = Lf * 9] = L[f] Llgl- 


Since, 
Lif\=Lle“}=— 5, Lig] = Lbsin()] = s—, 


we then conclude that L[u] = Lif * g] is given by 


1 
Lif * g] = ————.. 
l= Gye aD 
t 
Example 4.5.4. Use the Laplace transform to compute u(t) = Z e* sin(t — rT) dr. 
0 


Solution: Since u = f * g, with f(t) =e7' and g(t) = sin(t), then from Example 4.5.3, 


Liu] = Lif *g] = ESNCET 
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A partial fraction decomposition of the right hand side above implies that 


Llu] = ate ; 1) us) 


1 1 1 s 
7 scesr “ (s2 41) (s24 5) 
2 5(4le~4 + £{sin(#)] — L{cos(t))). 
This says that 
ioe s(n + sin() — cos(#)). 


So, we recover Eq. (4.5.2) in Example 4.5.1, that is, 


(f * g)(t) = =(e* + sin(t) — cos(t)), 


1 
2 
dq 
t 
Example 4.5.5. Find the function g such that f(t) = / sin(47) g(t—7) dr has the Laplace 
8 ie} 
(s? + 16)((s — 1)? + 9) 


Solution: Since f(t) = sin(4t) * g(t), we can write 


transform L[f] = 


Ss 


(s? + 16)((s — 1)? +9) = L[f] = L[sin(4t) * g(t)] 


= Lsin(4t)| L[g] 


4 
= —— £ 
so we get that 
4 8 1 8 
way = Byipe-ipsy > MG Go pee 
We now rewrite the right-hand side of the last equation, 
1 (s—1+1) 1 (s—1) 1 3 
Li{g| = — ————_ PSL I[g/= 
l= 9 (sy? + o 7 (GoPets Goa 


that is, 


Lio] = 5 (Llcos(3t)](s— 1) + 5£[sin(34)](s — 1) = + (Lle* cos(3e)] + 5 Llet sin(32)]), 


ALE 


which leads us to 
g(t) = ve (cos(3t) + sin(31)) 
4 3 
J 


4.5.3. Solution Decomposition. The Solution Decomposition Theorem is the main 
result of this section. Theorem 4.5.4 shows one way to write the solution to a general initial 
value problem for a linear second order differential equation with constant coefficients. The 
solution to such problem can always be divided in two terms. The first term contains 
information only about the initial data. The second term contains information only about 
the source function. This second term is a convolution of the source function itself and the 
impulse response function of the differential operator. 
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Theorem 4.5.4 (Solution Decomposition). Given constants do, G1, Yo, Y1 and a piecewise 
continuous function g, the solution y to the initial value problem 


yt+aytay=gt), yO=Hm, ¥(O=H, (4.5.6) 


can be decomposed as 


y(t) = yn(t) + (ys * 9)(t), (4.5.7) 
where yp, is the solution of the homogeneous initial value problem 
Yr + 41Yn +40Yn =0, yn(0)= yo, yn(0) = %, (4.5.8) 


and ys is the impulse response solution, that is, 


ys tarys+aoys = 4(t), ys(0)=0, y5(0) =0. 


Remark: The solution decomposition in Eq. (4.5.7) can be written in the equivalent way 


y(t) = yr(t) +f ys(T)g(t — T) dr. 


Also, recall that the impulse response function can be written in the equivalent way 
—cs 1 
Yo =£|—_], c#0, and ys =£>|_|, c=0. 
p(s) p(s) 
Proof of Theorem4.5.4: Compute the Laplace transform of the differential equation, 
Lly"| + a Ly] + a Lly] = Lig()]. 
Recalling the relations between Laplace transforms and derivatives, 
Lily") =s?Lly|—syo—y, Ly] = s Ly] — yo. 
we re-write the differential equation for y as an algebraic equation for L[y], 
(s? + ars + ao) Lly] — sYo — Ys — G1Yo = L[g(t)]- 
As usual, it is simple to solve the algebraic equation for L[y], 
(s +.41)Yo +1 1 
Lly] =7 9 2 
(s?+ais+do) (8? + a18 + do) 
Now, the function y;, is the solution of Eq. (4.5.8), that is, 
(s +41)Yo +41 
(s? + a,8 + do) 
And by the definition of the impulse response solution ys we have that 


1 
7 
vs) (s? + ais + do) 


L{g(t)]. 


Llyn] = 


These last three equation imply, 
Lly] = Llyn] + Llys] Lig). 
This is the Laplace transform version of Eq. (4.5.7). Inverting the Laplace transform above, 


y(t) = yn(t) + £~* [Lys] Lig (t)]). 
Using the result in Theorem 4.5.3 in the last term above we conclude that 


y(t) = yn(t) + (ys * g)(t). 
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Example 4.5.6. Use the Solution Decomposition Theorem to express the solution of 
y" +2y'+2y= g(t), y(0)=1, y(0)=-1. 
Solution: We first find the impuse response function 


Yyo(t) = ieee Fal ; p(s) = s? +2542. 


since p has complex roots, we complete the square, 
s?+2s+2=87?+28+1—-14+2=(s84+1)? +1, 
so we get 


w= Lf i ! => ys(t) =e * sin(t). 


s+4+1)?4+1 

We now compute the solution to the homogeneous problem 
Yn +2y,+2yn=0, yr(0)=1, y,(0) =-1. 
Using Laplace transforms we get 
Lly,] + 2L[y)] + 2L[yn] = 0, 
and recalling the relations between the Laplace transform and derivatives, 
(s? Llyn] — syn (0) — y},(0)) + 2(Liyn) = 8 Lyn] — yn(0)) + 2L[yn] = 0, 

using our initial conditions we get (s? + 2s +2) Llyn] — s+1—2=0, so 
(s+ 1) (s +1) 


Flyn] = (s2+2s+2) (st+124+1’ 


so we obtain 
yn(t) = cle cos(t)] 
Therefore, the solution to the original initial value problem is 


O-rerheah = a= tae. | a gat g—aide 


Example 4.5.7. Use the Laplace transform to solve the same IVP as above. 
y" +2y'+2y= g(t), y(0)=1, y(0)=-1. 
Solution: Compute the Laplace transform of the differential equation above, 
Lly"| + 2L[y'] + 2L[y] = Llo(t)], 
and recall the relations between the Laplace transform and derivatives, 
Lly"| = s? Lly]— sy(0)—y/(0), Ly] = s L[y] — y(0). 

Introduce the initial conditions in the equation above, 

Lly"|=s’Lly]-s(Q)-(-), = Ly] = sLly]-1, 
and these two equation into the differential equation, 

(s? + 2s + 2) Ly] -s+1—2=L[g(t)]. 

Reorder terms to get 


(s+1) 1 
£ly] = (s?+2s+2) ' (s?+2s +2) Llg(t)]. 
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Now, the function yp, is the solution of the homogeneous initial value problem with the same 
initial conditions as y, that is, 


i= (s+ 1) = (s +1) 7 
(s?+2s+2) (s+1)?41 
Now, the function ys is the impulse response solution for the differential equation in this 
Example, that is, 


Lle~* cos(t)}. 


cLlys] = ERED (ae ne a = Lie sin(t)]. 


If we put all this information together and we get 


Lily] = Llyn] + Llys] Lio) = y(t) = yn(t) + (ys * 9) (4), 
More explicitly, we get 


4.5.4. Exercises. 


4.5.1.- . 


4. THE LAPLACE TRANSFORM MI 


ETHOD 


4.5.2.- . 


CHAPTER 5 


Systems of Linear Differential Equations 


Newton’s second law of motion for point particles is one of the first differential equations 
ever written. Even this early example of a differential equation consists not of a single 
equation but of a system of three equation on three unknowns. The unknown functions are 
the particle three coordinates in space as function of time. One important difficulty to solve 
a differential system is that the equations in a system are usually coupled. One cannot solve 
for one unknown function without knowing the other unknowns. In this chapter we study 
how to solve the system in the particular case that the equations can be uncoupled. We call 
such systems diagonalizable. Explicit formulas for the solutions can be written in this case. 
Later we generalize this idea to systems that cannot be uncoupled. 
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5.1. General Properties 


This Section is a generalization of the ideas in § 2.1 from a single equation to a system of 
equations. We start introducing a linear system of differential equations with variable coef- 
ficients and the associated initial value problem. We show that such initial value problems 
always have a unique solution. We then introduce the concepts of fundamental solutions, 
general solution, fundamental matrix, the Wronskian, and Abel’s Theorem for systems. We 
assume that the reader is familiar with the concepts of linear algebra given in Chapter 8. 


5.1.1. First Order Linear Systems. A single differential equation on one unknown 
function is often not enough to describe certain physical problems. For example problems 
in several dimensions or containing several interacting particles. The description of a point 
particle moving in space under Newton’s law of motion requires three functions of time— 
the space coordinates of the particle—to describe the motion together with three differential 
equations. To describe several proteins activating and deactivating each other inside a cell 
also requires as many unknown functions and equations as proteins in the system. In this 
section we present a first step aimed to describe such physical systems. We start introducing 
a first order linear differential system of equations. 


Definition 5.1.1. Ann xn first order linear differential system is the equation 
a(t) = A(t) x(t) + B(t), (5.1.1) 


where then x n coefficient matrix A, the source n-vector b, and the unknown n-vector x are 
given in components by 


ays(t) +++ din(t) b,(t) x1 (E) 
Aw= |: : |, o®=]: |, a®=] : 
Gni(t) +++ Ann(t) b(t) Ln(t) 
The system in 5.1.1 is called homogeneous iff the source vector b = O, of constant coef- 


ficients iff the matriz A is constant, and diagonalizable iff the matrix A is diagonalizable. 


Remarks: 
w(t) 
(a) The derivative of a a vector valued function is defined as a(t) = : 


x, (t) 
(b) By the definition of the matrix-vector product, Eq. (5.1.1) can be written as 


w,(t) = aii(t) y(t) +--+ + Gin(t) tn(t) + bi (8), 


a(t) = Gni(t) v1(¢) +--+ + ann (t) en (t) + bn(t). 
(c) We recall that in § 8.3 we say that a square matrix A is diagonalizable iff there exists 
an invertible matrix P and a diagonal matrix D such that A = PDP™!. 


A solution of an n x n linear differential system is an n-vector valued function a, that 
is, a set of n functions {21,--- ,@p}, that satisfy every differential equation in the system. 
When we write down the equations we will usually write x instead of a(t). 


Example 5.1.1. The case n = 1 is a single differential equation: Find a solution x, of 


vy = ar(t) x, + b(t). 
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Solution: This is a linear first order equation, and solutions can be found with the inte- 
grating factor method described in Section 1.2. <J 


Example 5.1.2. Find the coefficient matrix, the source vector and the unknown vector for 
the 2 x 2 linear system 


a — aii (t) L4+ ay2(t) TT git), 
_ = Api (t) 2, + Az0(t) w2 + galt). 


Solution: The coefficient matrix A, the source vector b, and the unknown vector & are, 


A(t) = ies 7 . t= Bel , #i= bet 


azi(t) az2(t) ga(t) a(t) 


Example 5.1.3. Use matrix notation to write down the 2 x 2 system given by 
t= 21 — 2a, 


v= 4,4 2. 
Solution: In this case, the matrix of coefficients and the unknown vector have the form 
_{1 -1 _ [x,(t) 
fe E ‘il ,  of)= Be | 
This is an homogeneous system, so the source vector b = 0. The differential equation can 
be written as follows, 


r= %1— Zo ss | = | 1 7 | eee ee 


7 
Ly = Ly, + Xo 


sel 3 wo=fS). ==(3 


Solution: The 2 x 2 linear system is given by 


FI-E a) E]+be] © oo 
nie 3 1} |2 2e%* |? a, = 3a, + 2, + 2e™. 


dq 


Example 5.1.5. Show that the vector valued functions #) = et and #2) = | et 


are solutions to the 2 x 2 linear system a’ = Aa, where A = ; | ; 


Solution: We compute the left-hand side and the right-hand side of the differential equation 
above for the function «@) and we see that both side match, that is, 


a) — {8 2] [2] a — J4} ot 4 [2] cat. ay _ 2] (at) _ | 2 2t 
Az =(5 i =[ =2),| 2° erly (ce) Si Nae", 
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so we conclude that 2)’ = Av”). Analogously, 


r= ABe Lefer @-Bey=-fe 


so we conclude that 22)’ = Ax). <l 


Example 5.1.6. Find the explicit expression of the most general 3 x 3 homogeneous linear 
differential system. 


Solution: This is a system of the form a’ = A(t) a, with A being a 3 x 3 matrix. Therefore, 
we need to find functions x,, x2, and x3 solutions of 


x, = ai (t) 1 + a12(t) v2 + 413 (t) xs 


vl, = a21(t) v1 + d22(t) 2 + a13(t) xs 


x, = a31(t) M+ a32(t) Lot a33(t) a 


J 


5.1.2. Existence of Solutions. We first introduce the initial value problem for linear 
differential equations. This problem is similar to initial value problem for a single differential 
equation. In the case of an n x n first order system we need n initial conditions, one for 
each unknown function, which are collected in an n-vector. 


Definition 5.1.2. An Initial Value Problem for ann x n linear differential system is 
the following: Given ann x n matrix valued function A, and an n-vector valued function b, 
a real constant tp, and an n-vector X, find an n-vector valued function « solution of 


a = A(t) x+ B(t), tah = as 


Remark: The initial condition vector a represents n conditions, one for each component 
of the unknown vector a. 


Example 5.1.7. Write down explicitly the initial value problem for x = | | given by 


Xy 
Le 
7 — fe fi 3 
zw = Ag, x0) = [3]. A=(5 i 


Solution: This is a 2 x 2 system in the unknowns 2;, x2, with two linear equations 


£, = 2+ 30, 
a, = 32, +20, 
and the initial conditions x,(0) = 2 and x,(0) =3. <J 


The main result about existence and uniqueness of solutions to an initial value problem 
for a linear system is also analogous to Theorem 2.1.2 


Theorem 5.1.3 (Existence and Uniqueness). If the functions A and b are continuous on 
an open interval IC R, and if a is any constant vector and to is any constant in I, then 
there exist only one function x, defined an interval ICI witht € I, solution of the initial 
value problem 


a = A(t)x+ b(t), wa bs) = ty: (5.1.2) 


5.1. GENERAL PROPERTIES 235 


Remark: The fixed point argument used in the proof of Picard-Lindelof’s Theorem 1.6.2 
can be extended to prove Theorem 5.1.3. This proof will be presented later on. 


5.1.3. Order Transformations. There is a relation between solutions to n x n sys- 
tems of linear differential equations and the solutions of n-th order linear scalar differential 
equations. This relation can take different forms. In this section we focus on the case of 
n = 2 and we show two of these relations: the first order reduction and the second order 
reduction. 

It is useful to have a correspondence between solutions of ann xn linear system and an n- 
th order scalar equation. One reason is that concepts developed for one of the equations can 
be translated to the other equation. For example, we have introduced several concepts when 
we studied 2-nd order scalar linear equations in § 2.1, concepts such as the superposition 
property, fundamental solutions, general solutions, the Wronskian, and Abel’s theorem. It 
turns out that these concepts can be translated to 2 x 2 (and in general to n x n) linear 
differential systems. 


Theorem 5.1.4 (First Order Reduction). A function y solves the second order equation 


y” +a,(t) y! + a(t) y = W(t), (5.1.3) 

iff the functions x, = y and x, = y' are solutions to the 2 x 2 first order differential system 
2 Be (5.1.4) 

xl, = —ao(t) 1, — a,(t) x2 + B(t). (5.1.5) 


Proof of Theorem 5.1.4: 
(=) Given a solution y of Eq. (5.1.3), introduce the functions x, = y and x, = y’. Therefore 
Eq. (5.1.4) holds, due to the relation 


vy =y' = 2, 
Also Eq. (5.1.5) holds, because of the equation 
ry =y" =—a(t)y—a(t)y +d(t) = xf = —a(t) a, —a,(t) 2 + O(t). 
(<) Differentiate Eq. (5.1.4) and introduce the result into Eq. (5.1.5), that is, 


xl = x, x = —ao(t) 21 — a(t) x, + b(t). 
Denoting y = 21, we obtain, 


y” +ay(t) y! + ao(t) y = Ot). 


This establishes the Theorem. 


Example 5.1.8. Express as a first order system the second order equation 


y” + 2y’ + 2y = sin(at). 


Solution: Introduce the new unknowns 


/ / 
T= YH; Xo Yy Ly TQ. 


Then, the differential equation can be written as 
x, + 2x, + 2x, = sin(at). 
We conclude that 


e, =a, Ly = —2x, — 2x, + sin(at). 
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Remark: The transformation in Theorem 5.1.4 can be generalized to nxn linear differential 
systems and n-th order scalar linear equations, where n > 2. 


We now introduce a second relation between systems and scalar equations. 


Theorem 5.1.5 (Second Order Reduction). Any 2 x 2 constant coefficients linear system 


= Aa, with x= | , can be written as second order equations for x, and Xz, 
2 


x’ — tr(A) a’ + det(A) x= 0. (5.1.6) 

Furthermore, the solution to the initial value problem af = Ax, with «(0) = a, also solves 
the initial value problem given by Eq. (5.1.6) with initial condition 

cate 2 (0) = Ag. (5.1.7) 


Remark: In components, Eq. (5.1.6) has the form 
zy — tr(A) a, + det(A)z, =0, 
xy — tr (A) x + det(A) rz, = 0. 


First Proof of Theorem 5.1.5: We start with the following identity, which is satisfied by 
every 2 x 2 matrix A, (exercise: prove it on 2 x 2 matrices by a straightforward calculation) 
A? — tr(A) A+ det(A) I =0. 

This identity is the particular case n = 2 of the Cayley-Hamilton Theorem, which holds for 
every n X n matrix. If we use this identity on the equation for a’ we get the equation in 

Theorem 5.1.5, because 
a’ = (Aw) = Ag = A? x= tr(A) Aa — det(A)Ia. 
Recalling that Aw = a’, and Ix = a, we get the vector equation 
x’ — tr(A) a + det(A) x= 0. 


The initial conditions for a second order differential equation are x(0) and a/(0). The first 
condition is given by hypothesis, #(0) = a. The second condition comes from the original 
first order system evaluated at t = 0, that is a/(0) = Aa(0) = Aa. This establishes the 
Theorem. 


Second Proof of Theorem 5.1.5: This proof is based on a straightforward computation. 


Denote A= |“! 12 , then the system has the form 
a21 422 


LL = 11 11 +: 412 2p (5.1.10) 
Uy = 091 Ly + 22 Vp. (5.1.11) 


We start considering the case ajz 4 0. Compute the derivative of the first equation, 


/ / 
L, = 411%, 1 a12 2X5. 


Use Eq. (5.1.11) to replace x4, on the right-hand side above, 
af = a1. 0 + a12(a21 21 + 22 22). 
Since we are assuming that aj2 4 0, we can replace the term with x, above using Eq. (5.1.10), 
(x, — 11 21) 


" / 
LT, = 411%, + 412421 VM, + 412822 a 
12 
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A simple cancellation and reorganization of terms gives the equation, 
xl = (ay, + 22) 2 + (a12@21 — a11422) 24. 
Recalling that tr (A) = ai; + agg, and det(A) = a11422 — ai2G21, we get 
xy — tr(A) x, + det(A) x, = 0. 


The initial conditions for x, are x,(0) and 2/(0). The first one comes from the first compo- 
nent of «(0) = a, that is 

x1(0) = 2. (5.1.12) 
The second condition comes from the first component of the first order differential equation 
evaluated at t = 0, that is a/(0) = Aw(0) = Aap. The first component is 


x (0) = ay Loi + G12 Xoo- (5.1.13) 

Consider now the case a;2 = 0. In this case the system is 

&, =O 2 

Ly = Az, Ly + O92 Lo. 
In this case compute one more derivative in the first equation above, 

ote. 
Now rewrite the first equation in the system as follows 
gq (—2, +1121) = 0. 
Adding these last two equations for x, we get 
xy — ayy @, + a22 (—2' + 411 21) = 0, 

So we get the equation 


” 
Ly — 


(a1, + G22) v7, + (a11422) 41 = 0. 
Recalling that in the case aj2 = 0 we have tr (A) = a1; + aga, and det(A) = ai1422, we get 
zy — tr(A) a, + det(A) 2, = 0. 

The initial conditions are the same as in the case aj2 # 0. A similar calculation gives x, 
and its initial conditions. This establishes the Theorem. 


Example 5.1.9. Express as a single second order equation the 2 x 2 system and solve it, 
x = —2%,+4+ 322, 

ne = %1— Xp. 
Solution: Instead of using the result from Theorem 5.1.5, we solve this problem following 
the second proof of that theorem. But instead of working with x,, we work with x. We start 
computing x, from the second equation: x, = x, + 2,. We then introduce this expression 
into the first equation, 


(eh +22) =—-(ah+%.)+30, => aw +a, =-2),- 2.4 32, 
so we obtain the second order equation 
wl + 2af, — 22, = 0. 


We solve this equation with the methods studied in Chapter 2, that is, we look for solutions 
of the form x(t) = e”*, with r solution of the characteristic equation 


r2+4+2r—-2=0 re ==[-24V448 re =—-1l£ V3. 
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Therefore, the general solution to the second order equation above is 
3c; giver 46 el =e c+, c ER. 


Since x, satisfies the same equation as x2, we obtain the same general solution 


Cc ettv3)t 4 & el =a, G, CER. 


Example 5.1.10. Write the first order initial value problem 


Pee A=(5 ils o=(*), 0) = [5]. 


as a second order initial value problem for z,. Repeat the calculations for zp. 


Solution: From Theorem 5.1.5 we know that both x, and x2 satisfy the same differential 
equation. Since tr(A) = 1+4=5 and det(A) = 4-6 = —2, the differential equations are 


xi —5a—22,=0, vi, —5a5—-22, =0. 


From the same Theorem we know that the initial conditions for the second order differential 
equations above are 2(0) = a and a/(0) = Aa, that is 


— {ai(0)] — 15 rr, _ | e1(O)] — Jl 2] [5] — 417 
oe eto) = Be a= ee ~ 13 4} [6] — [39] 
therefore, the initial conditions for x, and x, are 


(0) =5, 2 (0)=17, and =e x,(0) =6, 24(0) = 39. 
dq 


5.1.4. Homogeneous Systems. Solutions to a linear homogeneous differential sys- 
tem satisfy the superposition property: Given two solutions of the homogeneous system, 
their linear combination is also a solution to that system. 

Theorem 5.1.6 (Superposition). If the vector functions x), 22) are solutions of 
x)! = Ag), ge = Ag?) 
then the linear combination « = aa + ba), for all a,b € R, is also solution of 


a= Ag. 


Remark: This Theorem contains two particular cases: 
(a) a=b=1: If a) and x) are solutions of an homogencous linear system, so is #) + x), 
(b) b=0 and a arbitrary: If 2 is a solution of an homogeneous linear system, so is aa). 


Proof of Theorem 5.1.6: We check that the function # = ax) + ba) is a solution of 
the differential equation in the Theorem. Indeed, since the derivative of a vector valued 
function is a linear operation, we get 


xe = (axl) + bxl?))' =aa)! + ba", 
Replacing the differential equation on the right-hand side above, 
gt =aAc +b Ax), 
The matrix-vector product is a linear operation, A(az) + ba\?)) =a Ax") +b Ax), hence, 


a = A(az + ba?)) => @g@=Az. 
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This establishes the Theorem. 


Example 5.1.11. Verify that 2) = Hl e-** and #2?) = | e** and a) + 2?) are 


solutions to the homogeneous linear system 


Solution: The function x) is solution to the differential equation, since 


(1)r_ 5 [1] .-2¢ Gj | 2-38] |L) ee 2) oe eg | L) ae 
ot) =a], AB S| eS | | 


We then conclude that a’ = Aw“), Analogously, the function 2) is solution to the 
differential equation, since 


—1 1 —3] /-1 —4 —1 
(2) At (2) _ At At _. At 
xr =al Ag =| ae =[ale =4[ |e. 


We then conclude that #2)’ = Ax’). To show that 2 + 2) is also a solution we could 
use the linearity of the matrix-vector product, as we did in the proof of the Theorem 5.1.6. 
Here we choose the straightforward, although more obscure, calculation: On the one hand, 


! —2Qe—7# — 4e* 
] = eet = |e a. 


—2t 
(1) (2). — |< —e 
w+ £ = 4 elt 


At 


On the other hand, 


A(a® ie a)) at Ee a ea ae “a 


| e72t _ ett _ 3e—2t _ 3e4t 
9 


1 | Je~?7# + e* —3e—7# + 3e4# + e- 7% + ett 
that is, 
—2e~ 2! — Ae*# 
A(a EG a(?)) - ES 4 det 
We conclude that (2) + a)" = A(x) of al?)), “a 


We now introduce the notion of a linearly dependent and independent set of functions. 


Definition 5.1.7. A set of n vector valued functions {x“),--- a} is called linearly 
dependent on an interval I € R iff for allt € I there exist constants c,,--+ , Cn, not all of 
them zero, such that it holds 


ct) (t)4+---+e, a(t) = 0. 


A set of n vector valued functions is called linearly independent on I iff the set is not 
linearly dependent. 


Remark: This notion is a generalization of Def. 2.1.6 from two functions to n vector valued 
functions. For every value of t € R this definition agrees with the definition of a set of linearly 
dependent vectors given in Linear Algebra, reviewed in Chapter 8. 
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We now generalize Theorem 2.1.7 to linear systems. If you know a linearly independent 
set of n solutions to an n x n first order, linear, homogeneous system, then you actually know 
all possible solutions to that system, since any other solution is just a linear combination of 
the previous n solutions. 


Theorem 5.1.8 (General Solution). If {a),--- , a} is a linearly independent set of so- 
lutions of then x n system a’ = Aa, where A is a continuous matrix valued function, then 
there exist unique constants C,,:++ ,Cpn such that every solution x of the differential equation 
xz = Ax can be written as the linear combination 


a(t) = c, & (t) +--+» +e, a (2). (5.1.14) 
Before we present a sketch of the proof for Theorem 5.1.8, it is convenient to state the 
following the definitions, which come out naturally from Theorem 5.1.8. 


Definition 5.1.9. 


(a) The set of functions {x),--- , 2°} is a fundamental set of solutions of the equation 
a = Ax iff the set {x,--- 2} is linearly independent and #! = Ax, for every 
Pe ony, 


(b) The general solution of the homogeneous equation af = Ax denotes any vector valued 
function yey that can be written as a linear combination 


Pcaltl =C; gh) (t) dle se gol Cn gh™) (t), 


where w),--- ,a(™ are the functions in any fundamental set of solutions of a! = Az, 
while c1,+++ ,Cn are arbitrary constants. 


Remark: The names above are appropriate, since Theorem 5.1.8 says that knowing the 
n functions of a fundamental set of solutions is equivalent to knowing all solutions to the 
homogeneous linear differential system. 


Example 5.1.12. Show that the set of functions {a = i e 2, gf) = | ett} is a 


fundamental set of solutions to the system a’ = Ag, where A = E - ‘ 

Solution: In Example 5.1.11 we have shown that 2 and #2) are solutions to the differential 
equation above. We only need to show that these two functions form a linearly independent 
set. That is, we need to show that the only constants c,, c, solutions of the equation below, 
for all t € R, are c, = c, = 0, where 


= —2t  __At 
enad tad na [fersa[G]e= [Ea Z] [5] -x05 
2 


where X(t) = [x (t), 2 (t)] and c= |< . Using this matrix notation, the linear system 
Ps 
for c,, C. has the form 


X(t)e=0. 
We now show that matrix X(t) is invertible for all t € R. This is the case, since its 
determinant is 


det (X(t) =|- =e*+e% =%e" 40 forallteR. 
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Since X(t) is invertible for t € R, the only solution for the linear system above is c = 0, 
that is, c, = co = 0. We conclude that the set fa), gl) \ is linearly independent, so it is a 
fundamental set of solution to the differential equation above. <J 


Proof of Theorem 5.1.8: The superposition property in Theorem 5.1.6 says that given any 
set of solutions {a,--- , (} of the differential equation a/ = A a, the linear combination 
a(t) = c, #4) (t) +--+» +e, a(t) is also a solution. We now must prove that, in the case 
that {a),--- , 2} is linearly independent, every solution of the differential equation is 
included in this linear combination. 

Let «x be any solution of the differential equation a = Aa. The uniqueness statement 
in Theorem 5.1.3 implies that this is the only solution that at t) takes the value a(t,). This 
means that the initial data x(t.) parametrizes all solutions to the differential equation. We 
now try to find the constants {c,,--- , Cn} solutions of the algebraic linear system 


(ty) = cy #) (to) +--+ en @™ (to). 
Introducing the notation 
C4 
x)= [ax (t),+-+ ,a™()] , c= 
Cn 
the algebraic linear system has the form 
x(ty) = X(t) c. 


This algebraic system has a unique solution c for every source a(t,) iff the matrix X (to) 
is invertible. This matrix is invertible iff det(X(to)) # 0. The generalization of Abel’s 
Theorem to systems, Theorem 5.1.11, says that det(X(to)) 4 0 iff the set {a ,--- ,al™} is 
a fundamental set of solutions to the differential equation. This establishes the Theorem. 


Example 5.1.13. Find the general solution to differential equation in Example 5.1.5 and 
then use this general solution to find the solution of the initial value problem 


ad = Ae oo) = |5) A=[} = 


Solution: From Example 5.1.5 we know that the general solution of the differential equation 
above can be written as 


Before imposing the initial condition on this general solution, it is convenient to write this 
general solution using a matrix valued function, X, as follows 


ot) =|" 5,+| [2] & 2 =x@e 


€ Co 
where we introduced the solution matrix and the constant vector, respectively, 
Qe%* et C1 
X(t) = z Qe-t|? c= cI 
The initial condition fixes the vector c, that is, its components c,, C,, as follows, 


(0) = X(0)e = c= [X(0)]~‘a(0). 
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Since the solution matrix X at t = 0 has the form, 
2 1 -1 1/2 -1 
X(0) = ki A = [xX@]~=3 E ar 


introducing [X(0)] ~" in the equation for ¢ above we get 


HL ME) = (5 


We conclude that the solution to the initial value problem above is given by 


se | et 43 | a 


dq 


5.1.5. The Wronskian and Abel’s Theorem. From the proof of Theorem 5.1.8 
above we see that it is convenient to introduce the notion of solution matrix and Wronskian 
of a set of n solutions to an n x n linear differential system, 


Definition 5.1.10. 


(a) A solution matria of any set of vector functions {x“),--- a}, solutions to a dif- 
ferential equation a = Aa, is then x n matrix valued function 


Xe) = [2 @),--- 2!) @)] . (5.1.15) 


Xis called a fundamental matriz iff the set {x ,--- ,a™} is a fundamental set. 
(b) The Wronskian of the set {a),--- , a} is the function W(t) = det (X(t). 


Remark: A fundamental matrix provides a more compact way to write the general solution 
of a differential equation. The general solution in Eq. (5.1.14) can be rewritten as 


C4 C1 
Pgen(t) = ce) (t) +++ + cna (t) = [2O(),--- 2H] |: | =XHe, c= 
Cn Cn 
This is a more compact notation for the general solution, 
Lgen(t) = X(t) e. (5.1.16) 


Remark: The definition of the Wronskian in Def 5.1.10 agrees with the Wronskian of 
solutions to second order linear scalar equations given in Def. 2.1.9, § 2.1. We can see 
this relation if we compute the first order reduction of a second order equation. So, the 
Wronskian of two solutions y;, 2 of the second order equation y” + ayy’ + doy = 0, is 


Yr Ye 
wr V2 
Now compute the first order reduction of the differential equation above, as in Theorem 5.1.4, 


Way = 


‘“ 
Ly — 2, 
/ 
Ly = —ApLy — AyXo. 


The solutions y1, y2 define two solutions of the 2 x 2 linear system, 


een 
YY Yo 
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The Wronskian for the scalar equation coincides with the Wronskian for the system, because 
gyal _ as? a” 
! 1) a?) 


Ny ag 


+ a det ([2™, z)]) =W. 
2 


Example 5.1.14. Find two fundamental matrices for the linear homogeneous system in 
Example 5.1.11. 


Solution: One fundamental matrix is simple to find, we use the solutions in Example 5.1.11, 


e-2t et 
X = [a), 22] = x= [Em “| 


A second fundamental matrix can be obtained multiplying by any nonzero constant each 
solution above. For example, another fundamental matrix is 


X = [ral 30] > xw=| 
dq 


Example 5.1.15. Compute the Wronskian of the vector valued functions given in Exam- 


ple 5.1.11, that is, a = H e~* and 2) = Bi a. 


Solution: The Wronskian is the determinant of the solution matrix, with the vectors placed 
in any order. For example, we can choose the order [x(), 2]. If we choose the order 
[x?), x], this second Wronskian is the negative of the first one. Choosing the first order 
for the solutions, we get 


W(t) = det ([2®, 2] ) 


We conclude that W(t) = 2e7*. <J 


3t -t 
Example 5.1.16. Show that the set of functions ge) = < oe ge?) = © + is 
263 —2e 


linearly independent for all t € R. 


eet et 


Solution: We compute the determinant of the matrix X(t) = so geek 


, that is, 
est e-t 


w(t) = Jest _Qe-t 


= —2e%—2e* => w(t)=—4e*%740 teER. 
dq 


We now generalize Abel’s Theorem 2.1.12 from a single equation to an n x n linear 
system. 


Theorem 5.1.11 (Abel). The Wronskian function W = det(X(t)) of a solution matrix 


X= [a), ee, al”) of the linear system a = A(t)x, where A is ann xn continuous matriz 
valued function on a domain I CR, satisfies the differential equation 
W'(t) = tr [A(d)] W(0). (5417) 


where tr (A) is the trace of A. Hence W is given by 


W(t) = W(t) e%, a(t) = | tr (A(r)) dr. 
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where to is any point in I. 


Remarks: 

(a) In the case of a constant matrix A, the equation above for the Wronskian reduces to 
W(t) = W(t) e* Cay tn) 

(b) The Wronskian function vanishes at a single point iff it vanishes identically for all t € I. 


(c) A consequence of (b): n solutions to the system a = A(t) are linearly independent at 
the initial time t, iff they are linearly independent for every time t € I. 


Proof of Theorem 5.1.11: The proof is based in an identity satisfied by the determinant 
of certain matrix valued functions. The proof of this identity is quite involved, so we do not 
provide it here. The identity is the following: Every n x n, differentiable, invertible, matrix 
valued function Z, with values Z(t) for t € R, satisfies the identity: 

“ det(Z) = det(Z) tr (722). 
We use this identity with any fundamental matrix X = [a vee ai") of the linear homo- 
geneous differential system a/ = Ag. Recalling that the Wronskian w(t) = det(X(t)), the 
identity above says, 

W'(t) = W(#) tr [X~*(t) X’()]. 
We now compute the derivative of the fundamental matrix, 

xX'= [at ol) = [Av),..- Axl”) = AX, 


where the equation on the far right comes from the definition of matrix multiplication. 
Replacing this equation in the Wronskian equation we get 

W'(t) = W(t) tr (X7*AX) = W(t) tr (X X7! A) = W(t) tr (A), 
where in the second equation above we used a property of the trace of three matrices: 
tr (ABC) = tr(CAB) = tr(BCA). Therefore, we have seen that the Wronskian satisfies 
the equation 

W'(t) = tr [A(d)] Wd), 

This is a linear differential equation of a single function W : R > R. We integrate it using 
the integrating factor method from Section 1.2. The result is 


W(t) = W(to) e®™, a(t) = / tr [A(7)] dr. 


to 


This establishes the Theorem. 


Example 5.1.17. Show that the Wronskian of the fundamental matrix constructed with 
the solutions given in Example 5.1.3 satisfies Eq. (5.1.17) above. 
Solution: In Example 5.1.5 we have shown that the vector valued functions a = i et 


1 
3° -2 


and a?) = | e—* are solutions to the system af = Ax, where A = E 5 


. The matrix 


Qe7# et 
X(t) = Ez | 


is a fundamental matrix of the system, since its Wronskian is non-zero, 


=4e'-e® => W(t) =3e'. 


5.1. GENERAL PROPERTIES 245 


We need to compute the right-hand side and the left-hand side of Eq. (5.1.17) and verify 
that they coincide. We start with the left-hand side, 
W’'(t) = 3e° = W(t). 
The right-hand side is 
tr (A) W(t) = (8 — 2) W(t) = W(#). 
Therefore, we have shown that W(t) = tr(A) W(t). <J 
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5.1.6. Exercises. 


5.1.1.- . 5.1.2.- . 
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5.2. Solution Formulas 


We find an explicit formula for the solutions of linear systems of differential equations with 
constant coefficients. We first consider homogeneous equations and later on we generalize 
the solution formula to nonhomogeneous equations with nonconstant sources. Both solution 
formulas for linear systems are obtained generalizing the integrating factor method for linear 
scalar equations used in § 1.1, 1.2. In this section we use the exponential of a matrix, so the 
reader should read Chapter 8, in particular § 8.3, and § 8.4. 

We also study the particular case when the coefficient matrix of a linear differential 
system is diagonalizable. In this case we show a well-known formula for the general solution 
of linear systems that involves the eigenvalues and eigenvectors of the coefficient matrix. To 
obtain this formula we transform the coupled system into an uncoupled system, we solve 
the uncoupled system, and we transform the solution back to the original variables. Later 
on we use this formula for the general solution to construct a fundamental matrix for the 
linear system. We then relate this fundamental matrix to the exponential formula for the 
solutions of a general linear system we found using the integrating factor method. 


5.2.1. Homogeneous Systems. We find an explicit formula for the solutions of first 
order homogeneous linear systems of differential equations with constant coefficients. This 
formula is found using the integrating factor method introduced in § 1.1 and 1.2. 


Theorem 5.2.1 (Homogeneous Systems). [f A is ann x n matria, to € R is an arbitrary 
constant, and & is any constant n-vector, then the initial value problem for the unknown 
n-vector valued function x given by 

od = Aa, x(to) = Xo, 
has a unique solution given by the formula 


a(t) = eAto) gy, (5.2.1) 


Remark: See § 8.4 for the definitions of the exponential of a square matrix. In particular, 
recall the following properties of e4*, for a constant square matrix A and any s,t € R: 


d -1 = 
a ee ee At A, (e4*) ee 


As At _ (A(s+t) 
ee =e : 
dt 


? 


Proof of Theorem 5.2.1: We generalize to linear systems the integrating factor method 
used in § 1.1 to solve linear scalar equations. Therefore, rewrite the equation as a —Ax= 0, 
where 0 is the zero n-vector, and then multiply the equation on the left by e~4*, 

e4ty —eAtAn=O0 = ec 4tae —Ae“*c=0, 


since e~4'A = Ae~“*. We now use the properties of the matrix exponential to rewrite the 
system as 
e Atal 4 (e-4t\'g=-0 = (e742) = 0. 
If we integrate in the last equation above, and we denote by c a constant n-vector, we get 
e“a(t)=e > x(t) =e4*e, 


where we used (e-4*)~? =e". If we now evaluate at t = t) we get the constant vector c, 


D = (to) =e4%e > cHe Aa. 
Using this expression for c in the solution formula above we get 


x(t) = eAteAbog, => x(t) = eA(t—to) a, 
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This establishes the Theorem. 


Example 5.2.1. Compute the exponential function e4* 


valued function x solution to the initial value problem 


=e A=[) i]. 2(0) = a = |). 


and use it to express the vector- 


Solution: The exponential of a matrix is simple to compute in the case that the matrix 
is diagonalizable. So we start checking whether matrix A above is diagonalizable. Theo- 
rem 8.3.8 says that a 2x 2 matrix is diagonalizable if it has two eigenvectors not proportional 
to each other. In oder to find the eigenvectors of A we need to compute its eigenvalues, 
which are the roots of the characteristic polynomial 


_ ese) ee 2 
p(A) = det(A — Al) = 9 i—- (1—A)* —4. 
The roots of the characteristic polynomial are 
(A-1Ps=4 @ Ag =142 Gas. FoSei, 


The eigenvectors corresponding to the eigenvalue ’, = 3 are the solutions v* of the linear 
system (A — 3I2)v' = 0. To find them, we perform Gauss operations on the matrix 


—2 2 1 -1 P ‘ , 1 
ssne[? 25 G+ neg = ve] 
The eigenvectors corresponding to the eigenvalue A. = —1 are the solutions uv" of the linear 
system (A+ Ip)v = 0. To find them, we perform Gauss operations on the matrix 


2 2 1 1 _ 7 —1 
A+h=|) > lo 4 > v=-v, => v= {ql 


Summarizing, the eigenvalues and eigenvectors of matrix A are following, 


a 
Then, Theorem 8.3.8 says that the matrix A is diagonalizable, that is A = PDP~', where 


_ fl -1 _ [3 0 y Lt A 
P= als p= |) af a = ile 


Now Theorem ?? says that the exponential of At is given by 


1 -1])fe# o]1f1 1 
At _ Dt p-1 _ me 
warm tall lo edsla af 


so we conclude that 


4 =3, v(t], and A.=—-1, la 


1 [(e* +e-*)  (e3* — e-*) 
At __ + 
= 5 (ete) fs. (5.2.2) 
Finally, we get the solution to the initial value problem above, 
1 [(e%% +e-*) (et —e-*)] [x 
= fale a. 2 ot 
x(t) = ea = 5) {es ae (e3¢ tiem?) lates’ 


In components, this means 
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5.2.2. Homogeneous Diagonalizable Systems. A linear system a = Az is diago- 
nalizable iff the coefficient matrix A is diagonalizable, which means that there is an invertible 
matrix P and a diagonal matrix D such that A = PDP~!. (See § 8.3 for a review on diag- 
onalizable matrices.) The solution formula in Eq. (5.2.1) includes diagonalizable systems. 
But when a system is diagonalizable there is a simpler way to solve it. One transforms the 
system, where all the equations are coupled together, into a decoupled system. One can solve 
the decoupled system, one equation at a time. The last step is to transform the solution 
back to the original variables. We show how this idea works in a very simple example. 


Example 5.2.2. Find functions 7,, x, solutions of the first order, 2x2, constant coefficients, 
homogeneous differential system 


i 
L, = 21 — Lo, 


/ 
Ly = —-%1 + 2X. 


Solution: As it is usually the case, the equations in the system above are coupled. One 
must know the function x2 in order to integrate the first equation to obtain the function 2. 
Similarly, one has to know function 2; to integrate the second equation to get function x2. 
The system is coupled; one cannot integrate one equation at a time. One must integrate 
the whole system together. 

However, the coefficient matrix of the system above is diagonalizable. In this case the 
equations can be decoupled. If we add the two equations equations, and if we subtract the 
second equation from the first, we obtain, respectively, 


(x1 =F iis) = 0, (x1 = 2)’ = 2(r1 = £2). 


To see more clearly what we have done, let us introduce the new unknowns y, = 7, + Zz, 
and y2 = ©; — Xz, and rewrite the equations above with these new unknowns, 


y=0, y= 2y. 


We have decoupled the original system. The equations for x; and x2 are coupled, but we 
have found a linear combination of the equations such that the equations for y, and y, are 
not coupled. We now solve each equation independently of the other. 


yZ=0 > w=, 


Yo = 2Ye Yo = 2€", 


with c,, c. € R. Having obtained the solutions for the decoupled system, we now transform 
back the solutions to the original unknown functions. From the definitions of y,; and y. we 


see that 
1 1 
t= 3 (us + Ye), a = 5 (Ys — Ya): 
We conclude that for all c,, c. € R the functions x7,, 7, below are solutions of the 2 x 2 


differential system in the example, namely, 


a(t) = =( + ee”), 2(t) = 


J 


The equations for x, and x, in the example above are coupled, so we found an appropri- 
ate linear combination of the equations and the unknowns such that the equations for the 
new unknown functions, y; and y2, are decoupled. We integrated each equation indepen- 
dently of the other, and we finally transformed the solutions back to the original unknowns 
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x, and x. The key step is to find the transformation from 2,, x2 to yi, y2. For general 
systems this transformation may not exist. It exists, however, for diagonalizable systems. 


Remark: Recall Theorem 8.3.8, which says that an n x n matrix A diagonalizable iff A 
has a linearly independent set of n eigenvectors. Furthermore, if \;, v“ are eigenpairs of 
A, then the decomposition A = PDP™! holds for 


P= [v,--- vo], D = diag [Ax,-+- , An] - 


For diagonalizable systems of homogeneous differential equations there is a formula for 
the general solution that includes the eigenvalues and eigenvectors of the coefficient matrix. 


Theorem 5.2.2 (Homogeneous Diagonalizable Systems). If the n x n constant matrix A 


is diagonalizable, with a set of linearly independent eigenvectors fold), vee vl} and corre- 
sponding eigenvalues {X1,-+- ,An}, then the system a’ = Ax has a general solution 
Cent) = ey emt yf) 4... 46, ernt yl, (5.2.3) 
Furthermore, every initial value problem a/(t) = Aa(t), with ato) = a, has a unique 
solution for every initial condition a € R”, where the constants c,,-++ ,Cn are solution of 
the algebraic linear system 
Lo = c er1!0 geil dkioeck Cn e>1t0 ym, (5.2.4) 


Remark: We show two proofs of this Theorem. The first one is just a verification that 
the expression in Eq. (5.2.3) satisfies the differential equation a’ = Aa. The second proof 
follows the same idea presented to solve Example 5.2.2. We decouple the system, we solve 
the uncoupled system, and we transform back to the original unknowns. The differential 
system is decoupled when written in the basis of eigenvectors of the coefficient matrix. 


First proof of Theorem 5.2.2: Each function « = et vo, for i =1,--- ,n, is solution 
of the system a = Az, because 
a! — y, it yl, Age =A (eM vo) = et Ay = ), ty, 
hence a)’ = Aa. Since A is diagonalizable, the set 
{2 (t) = ett oft)... al (t) = ern? wr} 
is a fundamental set of solutions to the system. Therefore, the superposition property says 
that the general solution to the system is 


a(t) = c, ey) 4... +e, Ort ™, 


The constants c,,--+ ,¢, are computed by evaluating the equation above at ¢, and recalling 
the initial condition x(t.) = a. This establishes the Theorem. 


Remark: In the proof above we verify that the functions a) = et uv are solutions, 
but we do not say why we choose these functions in the first place. In the proof below we 
construct the solutions, and we find that they are the ones given in the proof above. 


Second proof of Theorem 5.2.2: Since the coefficient matrix A is diagonalizable, there 
exist an invertible matrix P and a diagonal matrix D such that A = PDP~!. Introduce 
this expression into the differential equation and multiplying the whole equation by P7', 


Pee PDP” ane). 


Notice that to multiply the differential system by the matrix P~! means to perform a very 
particular type of linear combinations among the equations in the system. This is the linear 
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combination that decouples the system. Indeed, since matrix A is constant, so is P and D. 
In particular P~!a/ = (P-12)’, hence 
(Pe =P a), 
Define the new variable y = Pa The differential equation is now given by 
y(t) = Dy(t). 


Since matrix D is diagonal, the system above is a decoupled for the variable y. Transform 
the initial condition too, that is, P~ta(t)) = P~'a , and use the notation y, = P~'a, so 
we get the initial condition in terms of the y variable, 


Y(to) = Yo- 
Solve the decoupled initial value problem y/(t) = D y(t), 
v(t) = AL m(t), y(t) = ae, cet 
= = y=]: 
Y(t) = Xn Yn (0) Un(t) = ene", me 
Once y is found, we transform back to a, 
cert 
a(t) = Py(t) = [v,--- , of] =qe*y)4...46, erty, 
Cn ednt 


This is Eq. (5.2.3). Evaluating it at tp we get Eq. (5.2.4). This establishes the Theorem. 


Example 5.2.3. Find the vector-valued function x solution to the differential system 
a _ {3 _ |i 2 
z=Ag, 0) = [3 A=[} At 


Solution: First we need to find out whether the coefficient matrix A is diagonalizable or 
not. Theorem 8.3.8 says that a 2 x 2 matrix is diagonalizable iff there exists a linearly 
independent set of two eigenvectors. So we start computing the matrix eigenvalues, which 
are the roots of the characteristic polynomial 


7 espa) 2 |g ae 
p(A) = det(A — Alp) = | 9 (1—)| > (1—A)* —4. 
The roots of the characteristic polynomial are 
(A-1P=4 <4 Ag=142 i ae <P 


The eigenvectors corresponding to the eigenvalue ’, = 3 are the solutions v* of the linear 
system (A — 3I2)v' = 0. To find them, we perform Gauss operations on the matrix 


—2 2 1 -1 : " 1 
sonef? oP a = ies = of 
The eigenvectors corresponding to the eigenvalue A. = —1 are the solutions uv" of the linear 
system (A+ I2)v = 0. To find them, we perform Gauss operations on the matrix 


oneB ob y= a > [ih 


2 2 0 O 1 
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Summarizing, the eigenvalues and eigenvectors of matrix A are following, 


+ fl _ -_ |-1 
Az = 3, v= |i], and A. =-1, v= {ql 


Once we have the eigenvalues and eigenvectors of the coefficient matrix, Eq. (5.2.3) gives us 


the general solution 
1 -1 
5 OBE -t 
x(t) =ce li] tee : 


where the coefficients c, and c. are solutions of the initial condition equation 


ee eee 1 -1] fea] _ [3 a] 1f1 1) )3 
Sa’ | al |e 1 tte] =fel @ Jel =2}-1 1] Jal: 
We conclude that c, = 5/2 and c. = —1/2, hence 


_5 3 [1] 1 _, f-1 _ 1 [5e% + e~* 
x(t) => 3 e ll = 3 € 1 = x(t) = 3 Best = ent % 


Example 5.2.4. Find the general solution to the 2 x 2 differential system 


fre — f1 3 
a = Ag, A=(; ile 


Solution: We need to find the eigenvalues and eigenvectors of the coefficient matrix A. But 
they were found in Example 8.3.4, and the result is 


M=4, VO = Hl and A= -2, v= | ; 


With these eigenpairs we construct fundamental solutions of the differential equation, 


= et () 4) — pat JA 
A. = 4, v cl > x’(t) =e ap 
Z —1 z _ —1 
A. = -2, 1 =|] > a(t) =e ala: 


Therefore, the general solution of the differential equation is 


a(t) =c,e*# Hl eae | ‘ cc ER. 
<q 


The formula in Eq. 5.2.3 is a remarkably simple way to write the general solution of the 
equation a = Az in the case A is diagonalizable. It is a formula easy to remember, you 
just add all terms of the form e*! v’, where \;, v' is any eigenpair of A. But this formula 
is not the best one to write down solutions to initial value problems. As you can see in 
Theorem 5.2.2, we did not provide a formula for that. We only said that the constants 
C1,°** ,€m are the solutions of the algebraic linear system in (5.2.4). But we did not write 
the solution for the c’s. It is too complicated in this notation, though it is not difficult to 
do it on every particular case, as we did near the end of Example 5.2.3. 

A simple way to introduce the initial condition in the expression of the solution is with 
a fundamental matrix, which we introduced in Eq. (5.1.10). 
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Theorem 5.2.3 (Fundamental Matrix Expression). If thenxn constant matriz A is diago- 


nalizable, with a set of linearly independent eigenvectors {ylt), vee oi”) } and corresponding 
eigenvalues {Ay,--- ,An}, then, the initial value problem #& = Ax with x(to) = XH has a 
unique solution given by 

a(t) = X(t)X (to) * a (5.2.5) 
where X(t) = [et v),--- ,e%"t y\™)] is a fundamental matrix of the system. 


Proof of Theorem 5.2.3: If we choose fundamental solutions of af = Az to be 
{a (t) = ert? of)... aol (t) = err? wt, 
then the associated fundamental matrix is 
X(t) = [em vi)... ernt yi) , 


We use this fundamental matrix to write the general solution of the differential system as 
Xgen(t) = C1 ety 4... 46, dntyl™ = X(t) ¢, c= 


The equation from the initial condition is now 
B= £(to) = X(t)e > C= X(t) 1a, 


which makes sense, since X(t) is an invertible matrix for all t where it is defined. Using this 
formula for the constant vector c we get, 


a(t) = X()X(ty)—* a. 


This establishes the Theorem. 


Example 5.2.5. Find a fundamental matrix for the system below and use it to write down 
the general solution to the system 


js — fl 2 
z=Aa, A=[) ile 


Solution: One way to find a fundamental matrix of a system is to start computing the 
eigenvalues and eigenvectors of the coefficient matrix. The differential equation in this 
Example is the same as the one given in Example 5.2.3, where we found that the eigenvalues 
and eigenvectors of the coefficient matrix are 


ye fd _ -_ |-1 
A. = 3, v= |i]. and A. =-1, v= [a]. 


We see that the coefficient matrix is diagonalizable, so with the eigenpairs above we can 
construct a fundamental set of solutions, 


(29-8 [] oe FS] 


From here we construct a fundamental matrix 


xo=[fe fe]. 
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Then we have the general solution @gen(t) = X(t)e, where c = | , that is, 
3t =f 
€ =@ Cs 1 _+ |-1 
Sean (t) = és e-t | al ° Dean (t) = Cy et Hl +C.e€ : | | : 


J 


Example 5.2.6. Use the fundamental matrix found in Example 5.2.5 to write down the 
solution to the initial value problem 


= _ _ | Lor _ {1 2 
x = Az, 2{0) = m = |), A=[) if 


Solution: In Example 5.2.5 we found the general solution to the differential equation, 


3t -t 
ev —e Ce 
Lgen(t) = és e-t | fa : 


The initial condition has the form 


f]-<9-x0=-[ IE 


We need to compute the inverse of matrix X(0), 


So the solution to the initial value problem is, 
- e! -e*]1[ 1 1] fz 
a(t) = X(t)X(0)"'e = 2(t)= fs aa | ; E i | "| 


If we compute the matrix on the last equation, explicitly, we get, 


iiss {és eo") = 4 ia 


(e3! —e-*) (ec +e7*)| | a0 


Remark: In the Example 5.2.6 above we found that, for A = | 


xoxo = 3 [eet Ges) 


which is precisely the same as the expression for e4* we found in Eq. (5.2.2) in Example 5.2.2, 


Pa let) de | 


2 


This is not a coincidence. If a matrix A is diagonalizable, then e4(4~) = X(t)X(t.)~!. We 
summarize this result in the theorem below. 
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Theorem 5.2.4 (Exponential for Diagonalizable Systems). Ifannxn matriz A has linearly 
independent eigenvectors { v),--- ul) } with corresponding eigenvalues {\1,--- ,An}, then 
eA to) = X()X (to), 
where X(t) = [e? of)... ,ermt yl]. 


Proof of Theorem 5.2.4: We start rewriting the formula for the fundamental matrix 
given in Theorem 5.2.3, 


emt oo... 0 
X (t) — [wd erre, rey yi) Ant] = [w), evaah v)] : oie . ‘i > 
QO  -++ ernt 
The diagonal matrix on the last equation above can be written as 
emt... 0 
= diag[e™*,--- ,e4*]. 
QO «++ ernt 


If we recall the exponential of a matrix defined in § 8.4, we can see that the matrix above 
is an exponential, since 


diag|e™t* +++ ,e%"| =e", where Dt = diag/Ayt,--- , Ant]. 


One more thing, let us denote P = [v,.-- , v'”)], as we did in § 8.3. If we use these two 
expressions into the formula for X above, we get 
X(t) = Pe. 


Using properties of invertible matrices, given in § 8.2, and the properties of the exponential 
of a matrix, given in § 8.4, we get 


Ali) = (PePt)~" =e Ptop-t, 
where we used that (e? to" =e, These manipulations lead us to the formula 
X(t)X (to) 1 = PePte PP“! 4s X(t)X (tp)! = Pe? —*) p-}, 
Since A is diagonalizable, with A = PDP~!, we known from § 8.4 that 
PeD(t-to) p-1 = eAlt-to) 


We conclude that 
X(t)X(t))71 = AC), 


This establishes the Theorem. 


3 


Example 5.2.7. Verify Theorem 5.2.4 for matrix A = I i and t) = 0. 


Solution: We known from Example 5.2.4 that the eigenpairs of matrix A above are 


= |! —~.9 yo! 
A =4, V =() and A~=-2, ¥v =[yl 


This means that a fundamental matrix for A is 
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This fundamental matrix at t = 0 is 
— fl -l1 eo) oe ee 
X(0) = i a => XxX(0)-= . E | : 
Therefore we get that 


xoxot= [Er Sal]aLa i= a[etes rea) 


On the other hand, e“* can be computed using the formula e4’ = Pe?'P~!, where 


_ f4t 0 Jit =i a Lit 
el 2, Ge), oes Fl, 


Then we get 


so we get 


We conclude that e4“’ = X(t)X(0)~!. 


J 


5.2.3. Nonhomogeneous Systems. The solution formula of an initial value problem 
for an nonhomogeneous linear system is a generalization of the solution formula for a scalar 


equation given in § 1.2. We use the integrating factor method, just as in § 1.2. 


Theorem 5.2.5 (Nonhomogeneous Systems). If A is a constant n x n matrix and b is a 


continuous n-vector function, then the initial value problem 
z(t) = Aa(t) + b(t), tg) = My, 


has a unique solution for every initial condition to € R and a € R” given by 


t 
a(t) = eAl-to) gy, + eAtt-to) / e~ A(t—*0) b(r) dr. 


to 


Remark: Since e*4% are constant matrices, an equivalent expression for Eq. (5.2.6) is 


t 
a(t) = eA) a, + eM / e 47 B(r) dr. 


to 


In the case of an homogeneous system, b = O, we get Eq. (5.2.1). 


(5.2.6) 


Proof of Theorem 5.2.5: We generalize to linear systems the integrating factor method 
used in § 1.2 to solve linear scalar equations. Therefore, rewrite the equation as a —Aax= b, 


and then multiply the equation on the left by e~ 4%, 


e Ate —eAtAga=e 4th = ce Ate — AeA eg =e“ b, 


since e~4¢A = Ae~“*. We now use the formulas for the derivative of an exponential, 


e Atal + (Cas g=e 4th = (e~4¢z)' =e 4p. 


If we integrate on the interval [t,t] the last equation above, we get 


e Ata(t) — e~ 4% x(t.) = / e- AT B(r) dr. 


to 
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At) ee eAt, 


If we reorder terms and we use that (e 


t 
a(t) = ete Aton, + ft f eA b(r) dr. 
to 


At e 7 Ato 


Finally, using the group property of the exponential, e = eA(t—to) | we get 


t 
a(t) = eAlt-to) gy, 4 at | e 47 b(r) dr, 


to 


This establishes the Theorem. 


Example 5.2.8. Find the vector-valued solution x to the differential system 


af = Aa &, 0) =|3} a=[} cle »=[)]. 


Solution: In Example 5.2.3 we have found the eigenvalues and eigenvectors of the coefficient 
matrix, and the result is 


A =3, vd = i , and A=—-l, v= | 


The eigenvectors above say that A is diagonalizable, 


7 e (tal _ [3 0 
A= PDP™, p=[) Hie p=|) els 


We also know how to compute the exponential of a diagonalizable matrix, 


1 -1])fe# o]1f1 1 
At _ Dt p-1 _ a 
wearer tall lo edala af 


so we conclude that 


eft — ! le he. ie Se —At 


2 [(e* — e*) ne) Se 


~s 
The solution to the initial value problem above is, 


l 
x(t) = eta, + ag e 47 bdr. 
0 


Since 


in a similar way 


tipet (ey) Se |= 


9 (e-=" _ e7) (e—37 ak e7) 


Integrating the last expresion above, we get 


: 1 [-e~% — et 1 
—AT as 
jd e€ bdr = 5 ae + a : 
Therefore, we get 


gyn [ere] ier tey eerem pee oel Al 
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Multiplying the matrix-vector product on the second term of the left-hand side above, 
1 [5e%* + e~* 1] 1 [(e%? +e‘) 
a= ) be —e-t|~ lo} 73 (e3! — e~*)| 
We conclude that the solution to the initial value problem above is 


See 4 gH! =a 
z(t) = | 3e3t _ ot | : 


Remark: The formula in Eq. (5.2.6) is also called the variation of parameters formula. The 
reason is that Eq. (5.2.6) can be seen as 
a(t) = a(t) + ap(t) 


where a(t) = e is solution of the homogeneous equation a’ = Aw, and a, is a 
particular solution of the nonhomogeneous equation. One can generalize the variation of 
parameters method to get 2, as follows, 


ap(t) = X(t) u(t), 
where X(t) is a fundamental matrix of the homogeneous system, and wu are functions to be 
determined. If one introduce this z, in the nonhomogeneous equation, one gets 
X'u+Xu =AXu+b 


One can prove that the fundamental matrix satisfies the differential equation X’ = AX. If 
we use this equation for X in the equation above, we get 


AXu+Xu =AXu+b = Xuv=b 


A(t—to) 


so we get the equation 
w=X1b = ult)= fron b(r) dr. 
Therefore, a particular solution found with this method is 
%,(t) = X (£) f X(r)7~1 B(r) dr. 
to 
If we use that X(t.)~1 and X(to) are constant matrices, we get 


p(t) = X(t) [X (to) *X (to) | / X(r)7* b(r) dr 


(0) 


- X(X(t) f X (to)X(r)~* B(r) dr 


= xx) | [X(7)X (to) 71] b(r) dr. 
to 
Now, one can also prove that e4(4‘e) = X(t)X(t))~! for all n x n coefficient matrices, no 


just diagonalizable matrices. If we use that formula we get 


t 
Zp(t) = a e Arto) B(7) dr. 
to 
So we recover the expression in Eq. (5.2.6) for «© = a, + a. This is why Eq. (5.2.6) is also 

called the variation of parameters formula. 
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5.2.4. Exercises. 


5.2.1.- Use the exponential formula in 
Eq. (5.2.1) to find the solution of the 
initial value problem 


g =Azx, 2x(0)=2 


5 —2 
A= ? sie 
5.2.2.- Use the exponential formula in 


Eq. (5.2.1) to find the solution of the 
initial value problem 


where 


g =Azx, 2x(0) = 2 


where 


5.2.3.- * Follow the proof of Theorem 5.2.2 
to find the general solution of the sys- 
tem 

g =Azx, 2x(0) = 2 


7 -2 
ion he = 
(a) Find the eigenvalues and eigenvec- 


tors of the coefficient matrix. 
(b) Find functions yi, y2 of the form 


where 


Yi = 117%1 + 12X22 


Y2 = A21%1 + A22%2, 


so that the differential equation for 


is decoupled. 
(c) If we write the differential equation 
for y as 


y=By, 
find the matrix B. 
(d) Solve the differential equation for y. 
(e) Use the solution y to find the so- 
lution x of the original differential 
equation. Write the solution as 
a(t) = c1 a(t) + co a(t), 


and give explicit expressions for 
a (t) and a?) (t). 
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5.3. Two-Dimensional Homogeneous Systems 


2 x 2 linear systems are important not only by themselves but as approximations of 
more complicated nonlinear systems. They are important by themselves because 2 x 2 
systems are simple enough so their solutions can be computed and classified. But they 
are non-trivial enough so their solutions describe several situations including exponential 
decays and oscillations. In this Section we study 2 x 2 systems in detail and we classify them 
according the eigenvalues of the coefficient matrix. In a later Chapter we will use them as 
approximations of more complicated systems. 


5.3.1. Diagonalizable Systems. Consider a 2 x 2 constant coefficient, homogeneous 
linear differential system, 


a Ag, PX be “a 
G21 A22 
where we assume that all matrix coefficents are real constants. The characteristic polynomial 
of the coefficient matrix is p(A) = det(A — AI). This is a polynomial degree two with real 
coefficients. Hence it may have two distinct roots-real or complex—or one repeated real 
root. In the case that the roots are distinct the coefficient matrix is diagonalizable, see 
Chapter 8. In the case that the root is repeated, the coefficient matrix may or may not be 
diagonalizable. Theorem 5.2.2 holds for a diagonalizable 2 x 2 coefficient matrix and state 
it below in the notation we use for 2 x 2 systems. 


Theorem 5.3.1 (Diagonalizable Systems). If the 2 x 2 constant matrix A is diagonalizable 
with eigenpairs \, v\+), then the general solution of a! = Ax is 


Pron (t) = c, e**# oO) + 6 ert yO), (5.3.1) 


We classify the 2 x 2 linear systems by the eigenvalues of their coefficient matrix: 
(i) The eigenvalues \,, A- are real and distinct; 
(it) The eigenvalues X= = a + {7 are distinct and complex, with \, = -; 
(itt) The eigenvalues \, = A. = Ao is repeated and real. 


We now provide a few examples of systems on each of the cases above, starting with an 
example of case (i). 


Example 5.3.1. Find the general solution of the 2 x 2 linear system 


1 3 
a — Aa, A=; ifs 


Solution: We have computed in Example 8.3.4 the eigenpairs of the coefficient matrix, 


Ay = 4, v(t], and A. =—2, eet aa 


This coefficient matrix has distinct real eigenvalues, so the general solution to the differential 
equation is 
1 _o |—-1 
Been (t) = o H +ce* | dl . 
dq 


We now focus on case (ii). The coefficient matrix is real-valued with the complex-valued 
eigenvalues. In this case each eigenvalue is the complex conjugate of the other. A similar 
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result is true for nxn real-valued matrices. When such nxn matrix has a complex eigenvalue 
A, there is another eigenvalue \. A similar result holds for the respective eigenvectors. 


Theorem 5.3.2 (Conjugate Pairs). Ifannxn real-valued matrix A has a complex eigenpair 
A, v, then the complex conjugate pair X, U0 is also an eigenpair of matrix A. 


Proof of Theorem 5.3.2: Complex conjugate the eigenvalue eigenvector equation for A 
and v, and recall that matrix A is real-valued, hence A = A. We obtain, 


Av=Xv AD= XB, 


This establishes the Theorem. 

Complex eigenvalues of a matrix with real coefficients are always complex conjugate 
pairs. Same it’s true for their respective eigenvectors. So they can be written in terms of 
their real and imaginary parts as follows, 


Az = ati, v*) =atib, (5.3.2) 


where a, 6 € R and a, bE R”. 

The general solution formula in Eq. (5.3.1) still holds in the case that A has complex 
eigenvalues and eigenvectors. The main drawback of this formula is similar to what we 
found in Chapter 2. It is difficult to separate real-valued from complex-valued solutions. 
The fix to that problem is also similar to the one found in Chapter 2: Find a real-valued 
fundamental set of solutions. The following result holds for n x n systems. 


Theorem 5.3.3 (Complex and Real Solutions). If A+ = a+iG are eigenvalues of ann xn 
constant matric A with eigenvectors v+) = a+ib, where a, 8 € R and a, b € R”, and 
n > 2, then a linearly independent set of two complex-valued solutions to a = Ax is 


{ x) (t) =e yy, g(t) =, 3 (5.3.3) 
Furthermore, a linearly independent set of two real-valued solutions to a! = Az is given by 


{a (t) = (a cos(Bt) — b sin(Bt)) e®*, 2 (t) = (a sin(Bt) + b cos(Bt)) e*}. (5.3.4) 


Proof of Theorem 5.3.3: Theorem 8.3.9 implies the set in (5.3.3) is a linearly independent 
set. The new information in Theorem 5.3.3 above is the real-valued solutions in Eq. (5.3.4). 
They are obtained from Eq. (5.3.3) as follows: 


git) = (at ib) (ot 

at(g + ib) eti6t 

“(a+ ib) (cos(St) + i sin(ft)) 

= e*'(a cos(ft) — b sin(t)) + ie (a sin(Bt) + b cos(Gt)). 


Since the differential equation a’ = Az is linear, the functions below are also solutions, 


=e 


=e 


al) = 5 (a + 2°) = (a cos(6t) — b sin(t)) 


ge) = (2° —a) = (asin(Gt) + b cos(Bt)) e™. 


This establishes the Theorem. 


Example 5.3.2. Find a real-valued set of fundamental solutions to the differential equation 


, _ | 2 3 
av = Ag, A= Ee : (5.3.5) 
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Solution: Fist find the eigenvalues of matrix A above, 


(2—) 3 | 


3. (2-2) =(A-2)? +9 Ap SPS Si, 


0=| 


Then find the respective eigenvectors. The one corresponding to A+ is the solution of the 
homogeneous linear system with coefficients given by 


P- a = a os | ~ Ee | ~ E ‘| ~ | ‘ | 7 F 4 


Therefore the eigenvector vu = ° is given by 
2 


Vv; = —iv; v,=1, vf =-4, v | » A= 24+ 3%. 
The second eigenvector is the complex conjugate of the eigenvector found above, that is, 


v= He A. = 2— 31. 


=f). 


Then, the real and imaginary parts of the eigenvalues and of the eigenvectors are given by 


a= 2, B=3, a=(1), b= |]. 


So a real-valued expression for a fundamental set of solutions is given by 


g = ({7| cos(3t) — Fo sin(3t)) et => g= bee a, 


af = ([f]sincay + [9] estan) > = |S) 


Notice that 


J 


We end with case (iii). There are no many possibilities left for a 2 x 2 real matrix that 
both is diagonalizable and has a repeated eigenvalue. Such matrix must be proportional to 
the identity matrix. 


Theorem 5.3.4. Every 2x2 diagonalizable matrix with repeated eigenvalue Xo has the form 
A= Agi. 
Proof of Theorem 5.3.4: Since matrix A diagonalizable, there exists a matrix P invertible 


such that A= PDP7!. Since A is 2 x 2 with a repeated eigenvalue Ao, then 


Put these two fatcs together, 
A= PAI? SGP PS el 
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Remark: The general solution @gey for xz’ = I xis simple to write. Since any non-zero 
2-vector is an eigenvector of A,/,, we choos the linearly independent set 


Using these eigenvectors we can write the general solution, 


Geaalt) Sey erot oft) +4 ©, erot yl) = c, erot a Te ad | = Gel) = ert a . 
2 


5.3.2. Non-Diagonalizable Systems. A 2 x 2 linear systems might not be diagonal- 
izable. This can happen only when the coefficient matrix has a repeated eigenvalue and all 
eigenvectors are proportional to each other. If we denote by the repeated eigenvalue of 
a 2 x 2 matrix A, and by v an associated eigenvector, then one solution to the differential 
system a = Avis 


a(t) = ev. 


Every other eigenvector v associated with 4 is proportional to v. So any solution of the form 
ve is proportional to the solution above. The next result provides a linearly independent 
set of two solutions to the system a’ = Az associated with the repeated eigenvalue 4. 


Theorem 5.3.5 (Repeated Eigenvalue). If an 2 x 2 matrix A has a repeated eigenvalue 
A with only one associated eigen-direction, given by the eigenvector v, then the differential 
system a/(t) = A a(t) has a linearly independent set of solutions 


f{ai(t)=e%v, a(t) =e (vt+w)}, 
where the vector w is one of infinitely many solutions of the algebraic linear system 
(A-—AlI)w= v. (5.3.6) 
Remark: The eigenvalue X is the precise number that makes matrix (A—AJ) not invertible, 
that is, det(A — AI) = 0. This implies that an algebraic linear system with coefficient 
matrix (A — AJ) is not consistent for every source. Nevertheless, the Theorem above says 


that Eq. (5.3.6) has solutions. The fact that the source vector in that equation is v, an 
eigenvector of A, is crucial to show that this system is consistent. 


Proof of Theorem 5.3.5: One solution to the differential system is 2 (t) = e* v. Inspired 
by the reduction order method we look for a second solution of the form 


a) (t) = e* u(t). 
Inserting this function into the differential equation a’ = A x we get 
wt+tdAu=Au => (A-Alju=v. 
We now introduce a power series expansion of the vector-valued function u, 
= 2 
u(t) = Up + ut+ ut? +--+, 
into the differential equation above, 


(A —XI)(Uo + uit + ut? +--+) = (um, + 2ut+---). 
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If we evaluate the equation above at ¢t = 0, and then its derivative at t = 0, and so on, we 
get the following infinite set of linear algebraic equations 


(A — AI) = th, 
(A — AI) = 2tn, 
(A — AI) u, = 34; 


Here is where we use Cayley-Hamilton’s Theorem. Recall that the characteristic polynomial 
p(A) = det(A — XT) has the form 


p(A) = A? — tr (A) \ + det(A). 
Cayley-Hamilton Theorem says that the matrix-valued polynomial p(A) = 0, that is, 
A? — tr(A) A+ det(A) I =0. 
Since in the case we are interested in matrix A has a repeated root A, then 
pA) = (A -—AyP = — BAA +A. 
Therefore, Cayley-Hamilton Theorem for the matrix in this Theorem has the form 
0=A?-2\A4+NI => (A-XdXI?=0. 


This last equation is the one we need to solve the system for the vector-valued u. Multiply 
the first equation in the system by (A — AJ) and use that (A — AJ)? = 0, then we get 


0=(A-AI?w =(A-A)uy > (A-ADuy =O. 


This implies that uw, is an eigenvector of A with eigenvalue \. We can denote it as uw, = v. 
Using this information in the rest of the system we get 


(A — AI) = v, 
(A — AI)v = 2u Uw = O, 
(A— AI)u, = 3us us = 0, 
We conclude that all terms uw, = u; =--: = O. Denoting uw = w we obtain the following 
system of algebraic equations, 
(A— AI)w = , 
(A-—AI)v= 0. 


For vectors v and w solution of the system above we get u(t) = w+tv. This means that 
the second solution to the differential equation is 


a) (t) =e (tut w). 


This establishes the Theorem. 


Example 5.3.3. Find the fundamental solutions of the differential equation 


1|]-6 4 
/ — 
a = Aa, A=] z 4): 
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Solution: As usual, we start finding the eigenvalues and eigenvectors of matrix A. The 
former are the solutions of the characteristic equation 


(-3 -)) 1 3 1, 1 

= 2 =) 4 =\+42\41= 1" 

0 | (i QQ+5)(A4+5) +7 =¥ +2 (+1) 
Therefore, there solution is the repeated eigenvalue \ = —1. The associated eigenvectors 


are the vectors v solution to the linear system (A + I)v = 0, 


ee | ~ 73 i - ki | Se fF Be > vy, =2u. 


Choosing v, = 1, then v, = 2, and we obtain 


A=-1, v=[i]. 


Any other eigenvector associated to A = —1 is proportional to the eigenvector above. The 
matrix A above is not diagonalizable. So. we follow Theorem 5.3.5 and we solve for a vector 
w the linear system (A + I)w = v. The augmented matrix for this system is given by, 
-$ 1 2} jl —2 —4) jl 2 —4 
-4 4 1 1 -2 —4 0 60 0 
We have obtained infinitely many solutions given by 


aieacl 


As one could have imagined, given any solution w, the cv+ w is also a solution for any 
c€R. We choose the simplest solution given by 


w= al | 


Therefore, a fundamental set of solutions to the differential equation above is formed by 


a(t) =e7* | , (thse (t | ‘i a (5.3.7) 


J 


| => w,= 2uw,—-4. 
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5.4. Two-Dimensional Phase Portraits 


Figures are easier to understand than words. Words are easier to understand than 
equations. The qualitative behavior of a function is often simpler to visualize from a graph 
than from an explicit or implicit expression of the function. 


Take, for example, the differential equation 


y(t) = sin(y(t)). 
This equation is separable and the solution can 
be obtained using the techniques in Section 1.3. 
They lead to the following implicit expression for 
the solution y, 


—In|ese(y) + cot(y)| =t+e. 


Although this is an exact expression for the so- 
lution of the differential equation, the qualitative 
behavior of the solution is not so simple to un- 
derstand from this formula. The graph of the so- 
lution, however, given on the right, provides us 
with a clear picture of the solution behavior. In 
this particular case the graph of the solution can 
be computed from the equation itself, without the 
need to solve the equation. 


FIGURE 1. Several so- 
lutions of the equation 


y' = sin(y) 


In the case of 2 x 2 systems the solution vector has the form 

«,(t) 
x(t) = | : 
Two functions define the solution vector. In this case one usually graphs each component 
of the solution vector, x, and 22, as functions of t. There is, however, another way to graph 
a 2-vector-valued function: plot the whole vector a(t) at t on the plane x,,22. Each vector 
x(t) is represented by its end point, while the whole solution x represents a line with arrows 
pointing in the direction of increasing t. Such a figure is called a phase diagram or phase 
portrait. 

In the case that the solution vector x(t) is interpreted as the position function of a 
particle moving in a plane at the time t, the curve given in the phase portrait is the trajectory 
of the particle. The arrows added to this trajectory indicate the motion of the particle as 
time increases. 


In this Section we say how to plot phase portraits. We focus on solutions to the systems 
studied in the previous Section 5.3-2 x 2 homogeneous, constant coefficient linear systems 


a(t) = Aa(t). (5.4.1) 


Theorem 5.3.1 spells out the general solution in the case the coefficient matrix is diagonal- 
izable with eigenpairs A+, v~. The general solution is given by 


Bgen(t) = c,v°e*? + eve? (5.4.2) 


Solutions with real distinct eigenvalues are essentially different from solutions with complex 
eigenvalues. Those differences can be seen in their phase portraits. Both solution types are 
essentially different from solutions with a repeated eigenvalue. We now study each case. 
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5.4.1. Real Distinct Eigenvalues. We study the system in (5.4.1) in the case that 
matrix A has two real eigenvalues \, 4 A.. The case where one eigenvalues vanishes is left 
one of the exercises at the end of the Section. We study the case where both eigenvalues 
are non-zero. Two non-zero eigenvalues belong to one of hte following cases: 


(i) A. > A. > 0, both eigenvalues positive; 
(ti) 4, > 0 > A_, one eigenvalue negative and the other positive; 
(itt) O> A, > A, both eigenvalues negative. 


In a phase portrait the solution vector a(t) at t is displayed on the plane x,,x,. The 
whole vector is shown, only the end point of the vector is shown for t € (—oo,0o). The 
result is a curve in the 2,,x2 plane. One usually adds arrows to determine the direction of 
increasing t. A phase portrait contains several curves, each one corresponding to a solution 
given in Eq. (5.4.2) for particular choice of constants c, and c_.. A phase diagram can be 
sketched by following these few steps: 


(a) Plot the eigenvectors v* and v corresponding to the eigenvalues A, and 2.. 

(b) Draw the whole lines parallel to these vectors and passing through the origin. These 
straight lines correspond to solutions with either c, or c_ zero. 

(c) Draw arrows on these lines to indicate how the solution changes as the variable ¢ in- 
creases. If t is interpreted as time, the arrows indicate how the solution changes into 
the future. The arrows point towards the origin if the corresponding eigenvalue X is 
negative, and they point away form the origin if the eigenvalue is positive. 

(d) Find the non-straight curves correspond to solutions with both coefficient c, and c_ 
non-zero. Again, arrows on these curves indicate the how the solution moves into the 
future. 


Case \, > A. > 0. 


Example 5.4.1. Sketch the phase diagram of the solutions to the differential equation 


a thi 3 
xv = Ag, A=7 1 ol: (5.4.3) 


Solution: The characteristic equation for this matrix A is given by 
A. = 3, 


det(A— AT) =)? -—5A+6=0 = . - 


One can show that the corresponding eigenvectors are given by 


ef. *-E4) 


So the general solution to the differential equation above is given by 


at)=ave t+evey' o axt)h=ag Hl te EB 


In Fig. 2 we have sketched four curves, each representing a solution x corresponding to a 
particular choice of the constants c, and c_. These curves actually represent eight different 
solutions, for eight different choices of the constants c, and c_, as is described below. The 
arrows on these curves represent the change in the solution as the variable t grows. Since 
both eigenvalues are positive, the length of the solution vector always increases as t increases. 
The straight lines correspond to the following four solutions: 


Gg=1,¢c=0; a=0,c=1;5 a=-1lc=0; a=0,c=-1. 
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The curved lines on each quadrant correspond to the following four solutions: 


G=le=l a=1le=-1 «@=-le=l a=-l,c=-1. 


FIGURE 2. Eight solutions to Eq. (5.4.3), where A, > A- > 0. The trivial 
solution z= 0 is called an unstable point. 


dq 
Case \, >0> 2.. 
Example 5.4.2. Sketch the phase diagram of the solutions to the differential equation 
, _ {1 3 
x = Ag, A= I 4 : (5.4.4) 


Solution: In Example ?? we computed the eigenvalues and eigenvectors of the coefficient 
matrix, and the result was 


Y= 4, V= Hl and A. =-2, v= | : 


In that Example we also computed the general solution to the differential equation above, 


at)=avetticre 2 ath=aq H eo te i a 

In Fig. 3 we have sketched four curves, each representing a solution x corresponding to a 
particular choice of the constants c, and c_. These curves actually represent eight different 
solutions, for eight different choices of the constants c, and c_, as is described below. The 
arrows on these curves represent the change in the solution as the variable t grows. The 
part of the solution with positive eigenvalue increases exponentially when ¢ grows, while the 
part of the solution with negative eigenvalue decreases exponentially when t grows. The 
straight lines correspond to the following four solutions: 


Gg=1,c=0; a=0,ce=1;5 a=-lc=0; a=0,c=-1. 
The curved lines on each quadrant correspond to the following four solutions: 


G=le=1l a=1le=-1 a@=-le=1l a=-1,c=-1. 
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FIGURE 3. Several solutions to Eq. (5.4.4), A. > 0 > A... The trivial 
solution «= 0 is called a saddle point. 


Case 0 >. >. 


Example 5.4.3. Sketch the phase diagram of the solutions to the differential equation 


1 |-9 3 
Solution: The characteristic equation for this matrix A is given by 
A = -2, 


det(A—AI)=N7+5A4+6=0 => oe 


One can show that the corresponding eigenvectors are given by 


ee 3 es —2 
~ 41}? ~ | 2° 
So the general solution to the differential equation above is given by 


at)=ave*tievey' ©& axth=ag Hl ete EB ere, 

In Fig. 4 we have sketched four curves, each representing a solution x corresponding to a 
particular choice of the constants c, and c_. These curves actually represent eight differ- 
ent solutions, for eight different choices of the constants c, and c_, as is described below. 
The arrows on these curves represent the change in the solution as the variable t grows. 
Since both eigenvalues are negative, the length of the solution vector always decreases as t 
grows and the solution vector always approaches zero. The straight lines correspond to the 
following four solutions: 


G=1,6=—0; -¢=—0,;c=21; GS -le=0; c¢=0,¢ =I. 
The curved lines on each quadrant correspond to the following four solutions: 


G=le=1l a=1le=-1 a@=-le=l a=-l1c=-1. 
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eal, e.=1 


ee ==1,e=—f 


FIGURE 4. Several solutions to Eq. (5.4.5), where 0 > A, > A-. The trivial 
solution x= 0 is called a stable point. 


5.4.2. Complex Eigenvalues. A real-valued matrix may have complex-valued eigen- 
values. These complex eigenvalues come in pairs, because the matrix is real-valued. If 
\ is one of these complex eigenvalues, then \ is also an eigenvalue. A usual notation is 
Ast = a+if, with a, 8 € R. The same happens with their eigenvectors, which are written 


as v= = a+ib, with a, b € R”, in the case of an n x n matrix. When the matrix is the 
coefficient matrix of a differential equation, 


a — Ag, 
the solutions z*(t) = v'e**! and a(t) = ve** are complex-valued. In the previous Sec- 
tion we presented Theorem 5.3.3, which provided real-valued solutions for the differential 


equation. They are the real part and the imaginary part of the solution a*, given by 
z'(t) = (a cos(St) — b sin(6t)) e*, z(t) = (asin(St) + b cos(Bt)) e*’. (5.4.6) 


These real-valued solutions are used to draw phase portraits. We start with an example. 


A-t 


Example 5.4.4. Find a real-valued set of fundamental solutions to the differential equation 
below and sketch a phase portrait, where 


2 3 
-ae[ 3 


Solution: We have found in Example 5.3.2 that the eigenvalues and eigenvectors of the 
coefficient matrix are 


a’ = Az 


Mg =24+3, vt= | 


Writing them in real and imaginary parts, A, =a+i8 and v= = a+ 1b, we get 


a= 2, B=3, o=(1]. b=| ol. 


These eigenvalues and eigenvectors imply the following real-valued fundamental solutions, 


{210 = [SE] &, en = [se] e*} an 
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The phase diagram of these two fundamental solutions is given in Fig. 5 below. There is 
also a circle given in that diagram, corresponding to the trajectory of the vectors 


~1/,, _ | sin(3t) ~2/,, _ |—cos(3t) 

=e) baw oe sin(3t) | 
The phase portrait of these functions is a circle, since they are unit vector-valued functions— 
they have length one. <J 


FIGURE 5. The graph of the fundamental solutions #) and #) in Eq. (5.4.7). 


Suppose that the coefficient matris of a 2 x 2 differential equation 2’ = A x has complex 
eigenvalues and eigenvectors 


At =atif, vu =axtib. 


We have said that real-valued fundamental solutions are given by 
x'(t) = (a cos(St) — b sin(Bt)) e**, a(t) = (a sin(St) + b cos(Bt)) e**. 


We now sketch phase portraits of these solutions for a few choices of a, a and b. We start 
fixing the vectors a, b and plotting phase diagrams for solutions having a > 0, a = 0, and 
a <0. The result can be seen in Fig. 6. For a > 0 the solutions spiral outward as t increases, 
and for a < 0 the solutions spiral inwards to the origin as t increases. The rotation direction 
is from vector b towards vector a. The solution vector 0, is called unstable for a > 0 and 
stable for a < 0. 

We now change the direction of vector b, and we repeat the three phase portraits given 
above; for a > 0, a= 0, and a < 0. The result is given in Fig. 7. Comparing Figs. 6 and 7 
shows that the relative directions of the vectors a and b determines the rotation direction 
of the solutions as ¢ increases. 
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FIGURE 6. Fundamental solutions z* and 2? in Eq. (5.4.6) fora > 0,a = 0, 
and a < 0. The relative positions of a and 6b determines the rotation 
direction. Compare with Fig. 7. 


FIGURE 7. Fundamental solutions z and a? in Eq. (5.4.6) fora > 0, a =0, 
and a < 0. The relative positions of a and b determines the rotation 
direction. Compare with Fig. 6. 


5.4.3. Repeated Eigenvalues. A matrix with repeated eigenvalues may or may not 
be diagonalizable. If a 2 x 2 matrix A is diagonalizable with repeated eigenvalues, then 
by Theorem 5.3.4 this matrix is proportional to the identity matrix, A = AI, with A> the 
repeated eigenvalue. We saw in Section 5.3 that the general solution of a differential system 
with such coefficient matrix is 


ttgen(t) = | ont 


Phase portraits of these solutions are just straight lines, starting from the origin for A) > 0, 
or ending at the origin for Ao < 0. 

Non-diagonalizable 2 x 2 differential systems are more interesting. If a = Axis sucha 
system, it has fundamental solutions 


z'(t) = ver, a(t) = (vt + w) er, (5.4.8) 


where A, is the repeated eigenvalue of A with eigenvector v, and vector w is any solution of 
the linear algebraic system 


(A — AoL)w= v. 
The phase portrait of these fundamental solutions is given in Fig 8. To construct this 


figure start drawing the vectors v and w. The solution z* is simpler to draw than 2’, since 
the former is a straight semi-line starting at the origin and parallel to v. 
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FIGURE 8. Functions a‘, a in Eq. (5.4.8) for the cases Ay > 0 and Ay < 0. 


The solution 2? is more difficult to draw. One way is to first draw the trajectory of the 

time-dependent vector 
x =vtt+w. 

This is a straight line parallel to v passing through w, one of the black dashed lines in 
Fig. 8, the one passing through w. The solution 2 differs from Z* by the multiplicative 
factor e*°*. Consider the case \y > 0. For t > 0 we have 2*(t) > 2°(t), and the opposite 
happens for t < 0. In the limit t + —co the solution values z(t) approach the origin, since 
the exponential factor e%°' decreases faster than the linear factor t increases. The result is 
the purple line in the first picture of Fig. 8. The other picture, for Ay < 0 can be constructed 
following similar ideas. 
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5.4.4. Exercises. 


5.4.1.- . 5.4.2.- . 


CHAPTER 6 


Autonomous Systems and Stability 


By the end of the seventeenth century Newton had invented differential equations, discov- 
ered his laws of motion and the law of universal gravitation. He combined all of them to 
explain Kepler laws of planetary motion. Newton solved what now is called the two-body 
problem. Kepler laws correspond to the case of one planet orbiting the Sun. People then 
started to study the three-body problem. For example the movement of Earth, Moon, and 
Sun. This problem turned out to be far more difficult than the two-body problem and no 
solution was ever found. Around the end of the nineteenth century Henri Poincaré proved 
a breakthrough result. The solutions of the three body problem could not be found explic- 
itly in terms of elementary functions, such as combinations of polynomials, trigonometric 
functions, exponential, and logarithms. This led him to invent the so-called Qualitative 
Theory of Differential Equations. In this theory one studies the geometric properties of 
solutions-whether they show periodic behavior, tend to fixed points, tend to infinity, etc. 
This approach evolved into the modern field of Dynamics. In this chapter we introduce a 
few basic concepts and we use them to find qualitative information of a particular type of 
differential equations, called autonomous equations. 
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6.1. Flows on the Line 


This chapter is dedicated to study the qualitative behavior of solutions to differential equa- 
tions without actually computing the explicit expression of these solutions. In this section 
we focus on first order differential equations in one unknown function. We already have 
studied these equations in Chapter 1, § 1.1-1.4, and we have found formulas for their solu- 
tions. In this section we use these equations to present a new method to study qualitative 
properties of their solutions. Knowing the exact solution to the equation will help us un- 
derstand how this new method works. In the next section we generalize the ideas in this 
section to 2 x 2 systems of nonlinear differential equations. 


6.1.1. Autonomous Equations. Let us study, one more time, first order nonlinear 
differential equations. In § 1.3 we learned how to solve separable equations—we integrated 
on both sides of the equation. We got an implicit expression for the solution in terms of 
the antiderivative of the equation coefficients. In this section we concentrate on a particular 
type of separable equations, called autonomous, where the independent variable does not 
appear explicitly in the equation. For these systems we find a few qualitative properties 
of their solutions without actually computing the solution. We find these properties of the 
solutions by studying the equation itself. 


Definition 6.1.1. A first order autonomous differential equation is 
y = f(y), (6.1.1) 


where y! = and the function f does not depend explictly on t. 


dy 
dt’ 
Remarks: The equation in (6.1.1) is a particular case of a separable equation where the 
independent variable t does not appear in the equation. This is the case, since Eq. (6.1.1) 
has the form 


h(y) y = g(t), 
as in Def. 1.3.1, with h(y) = 1/f(y) and g(t) = 1. 


The autonomous equations we study in this section are a particular type of the separable 
equations we studied in § 1.3, as we can see in the following examples. 
Example 6.1.1. The following first order separable equations are autonomous: 
(a) y' =2y+3. 
(b) y/ = sin(y). 
y 
=rylia +). 
() y=ry-Z 
The independent variable t does not appear explicitly in these equations. The following 
equations are not autonomous. 
(a) y! =2y4+3t. 
(b) y’ = t? sin(y). 
¥y 
= (1 “ +). 
(c) yaty K <l 


Remark: Since the autonomous equation in (6.1.1) is a particular case of the separable 
equations from § 1.3, the Picard-Lindel6f Theorem applies to autonomous equations. There- 
fore, the initial value problem y’ = f(y), y(0) = yo, with f continuous, always has a unique 
solution in the neighborhood of t = 0 for every value of the initial data yo. 
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Sometimes an autonomous equation can be solved explicitly, with solutions simple to 
graph and simple to understand. Here is a well known example. 
Example 6.1.2. Find all solutions of the first order autonomous system 


y =ay+t+b, a,b>0. 


Solution: 


This is a linear, constant coefficients equation, 
so it could be solved using the integrating fac- 
tor method. But this is also a separable equa- 
tion, so we solve it as follows, 


dy 1 
=/dt=> -1 b) =t 
ay +b / a n(ay + b) + Co 
so we get, 
ay+b=e%e% 
and denoting c = e* /a, we get the expression 
b 
y(t) =ce™ — -. (6.1.2) 
a 
This is the expression for the solution we got FicurE 1. A few solutions 
in Theorem 1.1.2. J to Eq. (6.1.2) for different c. 


However, sometimes it is not so simple to grasp the qualitative behavior of solutions of 
an autonomous equation. Even in the case that we can solve the differential equation. 


Example 6.1.3. Sketch a qualitative graph of solutions to y’ = sin(y), for different initial 
data conditions y(0) = yo. 


Solution: We first find the exact solutions and then we see if we can graph them. The 
equation is separable, then 


yi) ty) 
CC) a Lao aa 


Use the usual substitution u = y(t), so du = y’(t) dt, so we get 


i du ae 
yo sin(u) = 


In an integration table we can find that 


nf a => In| 


sin(y) in| sin (Yo) = 
1 + cos(y) 1 + cos(Yo) , 
We can rewrite the expression above in terms of one single logarithm, 
pj a el) 
(1 + cos(y)) sin(Yo) : 


If we compute the exponential on both sides of the equation above we get an implicit 
expression of the solution, 


Yo 


sin(y)  __sin(Yo) ab. (6.1.3) 


(1 + cos(y)) (1 + cos(yo)) 


6.1. FLOWS ON THE LINE 281 


Although we have the solution, in this case in implicit form, it is not simple to graph that 
solution without the help of a computer. So, we do not sketch the graph right now. <J 


It is not so easy to see certain properties of the solution from the exact expression 
in (6.1.3). For example, what is the behavior of the solution values y(t) as t — co for 
an arbitrary initial condition y).? To be able to answer questions like this one, is that we 
introduce a new approach, a geometric approach. 


6.1.2. Geometrical Characterization of Stability. The idea is to obtain quali- 
tative information about solutions to an autonomous equation using the equation itself, 
without solving it. We now use the equation in Example 6.1.3 to show how this can be 
done. 


Example 6.1.4. Sketch a qualitative graph of solutions to y/ = sin(y), for different initial 
data conditions y(0). 


Solution: The differential equation has the form y’ = f(y), where f(y) = sin(y). The first 
step in the graphical approach is to graph the function f. 


=f f(y) = sin(y) 


FIGURE 2. Graph of the function f(y) = sin(y). 


The second step is to identify all the zeros of the function f. In this case, 


f(y) =sin(y) = 0 Yn=nn, where n=--- ,—2,—1,0,1,2,---. 


It is important to realize that these constants y, are solutions of the differential equation. 
On the one hand, they are constants, t-independent, so y/, = 0. On the other hand, these 
constants y, are zeros of f, hence f(yn) = 0. So yp are solutions of the differential equation 


0=y), = f(Yn) = 0. 
The constants y,, are called critical points, or fixed points. When the emphasis is on the fact 


that these constants define constant functions solutions of the differential equation, then 
they are called stationary solutions, or equilibrium solutions. 


vy =z f(y) = sin(y) 


FIGURE 3. Critical points and increase/decrease information added to Fig. 2. 


282 6. AUTONOMOUS SYSTEMS AND STABILITY 


The third step is to identify the regions on the line where f is positive, and where f 
is negative. These regions are bounded by the critical points. A solution y of y’ = f(y) is 
increasing for f(y) > 0, and it is decreasing for f(y) < 0. We indicate this behavior of the 
solution by drawing arrows on the horizontal axis. In an interval where f > 0 we write a 
right arrow, and in the intervals where f < 0 we write a left arrow, as shown in Fig. 3. 


There are two types of critical points in Fig. 3. The points y-, = —7, y, = 7, have 
arrows on both sides pointing to them. They are called attractors, or stable points, and 
are pictured with solid blue dots. The points y-. = —27, yo = 0, yo = 27, have arrows on 


both sides pointing away from them. They are called repellers, or unstable points, and are 
pictured with white dots. 

The fourth step is to find the regions where the curvature of a solution is concave up 
or concave down. That information is given by y” = (y’)’ = (f(y))' = f’(y)y’ = f(y) Fly): 
So, in the regions where f(y) f’(y) > 0 a solution is concave up (CU), and in the regions 
where f(y) f’(y) < 0a solution is concave down (CD). See Fig. 4. 


FIGURE 4. Concavity information on the solution y added to Fig. 3. 


This is all we need to sketch a qualitative graph of solutions to the differential equation. 
The last step is to collect all this information on a ty-plane. The horizontal axis above is 
now the vertical axis, and we now plot soltuions y of the differential equation. See Fig. 5. 

Fig. 5 contains the graph of several solutions y for different choices of initial data y(0). 
Stationary solutions are in blue, t-dependent solutions in green. The stationary solutions 


are separated in two types. The stable solutions y_, = —7, y, = 7, are pictured with solid 
blue lines. The unstable solutions y-» = —27, yo = 0, yo = 27, are pictured with dashed 
blue lines. <J 


Remark: A qualitative graph of the solutions does not provide all the possible information 
about the solution. For example, we know from the graph above that for some initial 
conditions the corresponding solutions have inflection points at some t > 0. But we cannot 
know the exact value of t where the inflection point occurs. Such information could be 
useful to have, since |y’| has its maximum value at those points. 


In the Example 6.1.4 above we have used that the second derivative of the solution 
function is related to f and f’. This is a result that we remark here in its own statement. 


Theorem 6.1.2. If y is a solution of the autonomous system y’ = f(y), then 
vy" = f(y) £)- 


Remark: This result has been used to find out the curvature of the solution y of an 
autonomous system y’ = f(y). The graph of y has positive curvature iff f’(y) f(y) > 0 and 
negative curvature iff f’(y) f(y) < 0. 
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ID TU ge et a af a a fi a eh ea aaa Unstable 


T Stable 


Unstable 


—T Stable 


OT rr ccc cece Unstable 


FIGURE 5. Qualitative graphs of solutions y for different initial conditions. 


Proof: The differential equation relates y” to f(y) and f’(y), because of the chain rule, 


y= £(#) = “ (y(t)) = i ae y" = f'(y) f(y). 


6.1.3. Critical Points and Linearization. Let us summarize a few definitions we 
introduced in the Example 6.1.3 above. 


Definition 6.1.3. 
A point y. is a critical point of y' = f(y) iff f(y) =0. A critical points is: 


(i) an attractor (or sink), iff solutions flow toward the critical point; 
(it) a repeller (or source), iff solutions flow away from the critical point; 
(itt) neutral, iff solution flow towards the critical point from one side and flow away from 
the other side. 


In this section we keep the convention used in the Example 6.1.3, filled dots denote 
attractors, and white dots denote repellers. We will use a a half-filled point for neutral 
points. We recall that attractors have arrows directed to them on both sides, while repellers 
have arrows directed away from them on both sides. A neutral point would have an arrow 
pointing towards the critical point on one side and the an arrow pointing away from the 
critical point on the other side. We will usually mention critical points as stationary solutions 
when we describe them in a yt-plane, and we reserve the name critical point when we describe 
them in the phase line, the y-line. 

We also talked about stable and unstable solutions. Here is a precise definition. 
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Definition 6.1.4. Let yo be a a constant solution of y' = f(y), and let y be a solution with 
initial data y(0) = y,. The solution given by yo is stable iff given any € > 0 there is ad > 0 
such that if the initial data y, satisfies |y, — yo| < 6, then the solution values y(t) satisfy 
ly(t) — yo| < € for allt > 0. Furthermore, if limy+oo y(t) = Yo, then yo is asymptotically 
stable. If yo is not stable, we call it unstable. 


The geometrical method described in Example 6.1.3 above is useful to get a quick 
qualitative picture of solutions to an autonomous differential system. But it is always nice 
to complement geometric methods with analytic methods. For example, one would like an 
analytic way to determine the stability of a critical point. One would also like a quantitative 
measure of a solution decay rate to a stationary solution. A linear stability analysis can 
provide this type of information. 

We start assuming that the function f has a Taylor expansion at any yo. That is, 


F(y) = F(yo) + f’(Yo) (y — Yo) + O((y — Yo)”).- 
Denote fo = f(yo), then f} = f’(yo), and introduce the variable u = y — yo. Then we get 


fly) = fot fout o(u’). 
Let us use this Taylor expansion on the right hand side of the equation y’ = f(y), and 
recalling that y/ = (yo + u)’ =u’, we get 
yY=fy) & wW=fot foutolu’). 
If yo is a critical point of f, then fy = 0, then 
y=fy) @ w= fouto(u*). 

From the equations above we see that for y(t) close to a critical point y the right hand 
side of the equation y’ = f(y) is close to f{u. Therefore, one can get information about 


a solution of a nonlinear equation near a critical point by studying an appropriate linear 
equation. We give this linear equation a name. 


Definition 6.1.5. The linearization of an autonomous system y! = f(y) at a critical 
point y. is the linear differential equation for the function u given by 


ul = f' (Ye) u. 
Remark: The prime notation above means, u’ = du/dt, and f’ = df /dy. 


Example 6.1.5. Find the linearization of the equation y’ = sin(y) at the critical point 
Yn = na. Write the particular cases for n = 0,1 and solve the linear equations for arbitrary 
initial data. 


Solution: If we write the nonlinear system as y’ = f(y), then f(y) = sin(y). We then 
compute its y derivative, f’(y) = cos(y). We evaluate this expression at the critical points, 
f’ (Yn) = cos(nm) = (—1)". The linearization at y,, of the nonlinear equation above is the 
linear equation for the unknown function u,, given by 


u,, = (—1)" un. 
The particular cases n = 0 and n = 1 are given by 

if, = We U, = —U4. 
It is simple to find solutions to first order linear homogeneous equations with constant 
coefficients. The result, for each equation above, is 


uo(t) = uo(0) e*, ux(t) = ux(0) e~*. 
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As we see in the the Def. 6.1.5 and in Example 6.1.5, the linearization of y’ = f(y) at 
a critical point y, is quite simple, it is the linear equation u’ = au, where a = f’(y)). We 
know all the solutions to this linear equation, we computed them in § 1.1. 


Theorem 6.1.6 (Stability of Linear Equations). The constant coefficent linear equation 
u' =au, witha £0, has only one critical point up = 0. And the constant solution defined 
by this critical point is unstable for a > 0, and it is asymptotically stable for a < 0. 


Proof of Theorem 6.1.6: The critical points of the linear equation u’ = au are the 
solutions of au = 0. Since a 4 0, that means we have only one critical point, uy» = 0. Since 
the linear equation is so simple to solve, we can study the stability of the constant solution 
Uo = 0 from the formula for all the solutions of the equation, 


u(t) = u(0) e. 
The graph of all these solutions is sketch in Fig. 6. in the case that u(0) 4 0, we see that 


for a > 0 the solutions diverge to +00 as t > oo, and for a < 0 the solutions approach to 
zero as t + OO. 


U 


uo > 0 uo > 0 


Unstable Stable 


uo <0 uo <0 


FIGURE 6. The graph of the functions u(t) = u(0) e for a> 0 anda <0. 


Remark: In the Example 6.1.5 above (and later on in Example 6.1.8) we see that the 
stability of a critical point y. of a nonlinear differential equation y’ = f(y) is the same as 
the stability of the trivial solution u = 0 of the linearization u’ = f’(y.) u. This is a general 
result, which we state below. 


Theorem 6.1.7 (Stability of Nonlinear Equations). Let y. be a critical point of the 
autonomous system y’ = f(y). 


(a) The critical point y. is stable iff f'(yc) <9. 
(b) The critical point y. is unstable iff f’(yc) > 0. 


Furthermore, If the initial data y(0) ~ yc, is close enough to the critial point y., then the 
solution with that initial data of the equation y' = f(y) are close enough to y. in the sense 


y(t) = ye + ult), 


where u is the solution to the linearized equation at the critical point yc, 
ul = f"(ye)u, — u(0) = y(0) — Ye. 
Remark: The proof of this result can be found in § 2.4 in Strogatz textbook [12]. 


Remark: The first part of Theorem 6.1.7 highlights the importance of the sign fo the 
coefficient f’(y.), which determines the stability of the critical point y.. The furthermore 
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part of the Theorem highlights how stable is a critical point. The value |f’(y.)| plays 
a role of an exponential growth or a exponential decay rate. Its reciprocal, 1/|f’(y.)| is 
a characteristic scale. It determines the value of t required for the solution y to vary 
significantly in a neighborhood of the critical point ye. 


6.1.4. Population Growth Models. The simplest model for the population growth 
of an organism is N’ = rN where N(t) is the population at time ¢t and r > 0 is the growth 
rate. This model predicts exponential population growth N(t) = Noe", where No = N(0). 
This model assumes that the organisms have unlimited food supply, hence the per capita 
growth N’/N =r is constant. 

A more realistic model assumes that the per capita growth decreases linearly with N, 
starting with a positive value, 7, and going down to zero for a critical population N = K > 0. 
So when we consider the per capita growth N’/N as a function of N, it must be given by 
the formula N’/N = —(r/K)N +r. This is the logistic model for population growth. 


Definition 6.1.8. The logistic equation describes the organisms population function N 
in time as the solution of the autonomous differential equation 


N 
N'=rN(1-=), 
K 


where the initial growth rate constant r and the carrying capacity constant K are positive. 


Remark: The logistic equation is, of course, a separable equation, so it can be solved using 
the method from § 1.3. We solve it below, so you can compare the qualitative graphs from 
Example 6.1.7 with the exact solution below. 


Example 6.1.6. Find the exact expression for the solution to the logistic equation for 
population growth 


y=ry(1- 4), y(0) = Yo, ORG KG 


Solution: This is a separable equation, 


= y’ dt re 
— | ——_ = Co. 
r J (K=y)y : 


The usual substitution u = y(t), so du = y’ dt, implies 


=f du b+ aa ~/ zl 1 1 d pk 
ee ——— ’ t U = Co. 
r J (K-u)u a rj Kl(k-u) u : 


where we used partial fractions decomposistion to get the second equation. Now, each term 
can be integrated, 


[—In(|K — y|) + In(|y|)] = rt + reo. 
We reorder the terms on the right-hand side, 


intl) =r tre => at) = ce, c=e'®, 


The analysis done in Example 6.1.4 says that for initial data 0 < yo < K we can discard the 
absolute values in the expression above for the solution. Now the initial condition fixes the 
value of the constant c, 

Kyo 
Yo + (K — yo) ent 


Yo 
c y(t 
i, (t) 
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Remark: The expression above provides all solutions to the logistic equation with initial 
data on the interval (0, &). With some more work one could graph these solution and get 
a picture of the solution behaviour. We now use the graphical method discussed above to 
get a qualitative picture of the solution graphs without solving the differential equation. 


Example 6.1.7. Sketch a qualitative graph of solutions for different initial data conditions 
y(0) = yo to the logistic equation below, where r and K are given positive constants, 


- 1-2). 
y ry ( K 


Solution: 

The logistic differential equation for pop- 
ulation growth can be written y’ = f(y), 
where function f is the polynomial 


Yy 
Sepia 2) 
fy) =ry ( K 
The first step in the graphical approach 


is to graph the function f. The result is 
in Fig. 7. 


FIGURE 7. The graph of f. 


The second step is to identify all criti- 
cal points of the equation. The critical 
points are the zeros of the function f. In 
this case, f(y) = 0 implies 
Yo = 0, YW = K. 

The third step is to find out whether 
the critical points are stable or unstable. 
Where function f is positive, a solution 
will be increasing, and where function f 
is negative a solution will be decreasing. 
These regions are bounded by the crit- 
ical points. Now, in an interval where 
f > 0 write a right arrow, and in the 
intervals where f < 0 write a left arrow, 
as shown in Fig. 8. 


FIGURE 8. Critical points added. 


This is all the information we need to sketch a qualitative graph of solutions to the 
differential equation. So, the last step is to put all this information on a yt-plane. The 
horizontal axis above is now the vertical axis, and we now plot solutions y of the differential 
equation. The result is given in Fig. 10. 

The picture above contains the graph of several solutions y for different choices of initial 
data y(0). Stationary solutions are in blue, t-dependent solutions in green. The stationary 
solution yo = 0 is unstable and pictured with a dashed blue line. The stationary solution 
y, = K is stable and pictured with a solid blue line. <J 


Example 6.1.8. Find the linearization of the logistic equation y’ = ry (1 - 2) at the 


critical points yo = 0 and y, = K. Solve the linear equations for arbitrary initial data. 
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The fourth step is to find the regions 
where the curvature of a solution is con- 
cave up or concave down. That informa- 
tion is given by y”. But the differential 
equation relates y” to f(y) and f’(y). 
We have shown in Example 6.1.4 that 
the chain rule and the differential equa- 
tion imply, 


y” = f'(y) fy) 


So the regions where f(y) f/(y) > 0a 
solution is concave up (CU), and the re- 
gions where f(y) f’(y) < 0 a solution 
is concave down (CD). The result is in FIGURE 9. Concavity informa- 
Fig. 9. tion added. 


CU 


x 


Stable 


Unstable 


CD 


FIGURE 10. Qualitative graphs of solutions y for different initial conditions. 


Solution: If we write the nonlinear system as y’ = f(y), then f(y) = ry(1 — 2). The 


2r 
critical points are yo) = 0 and y, = K. We also need to compute f’(y) =r — ras For the 
critical point yo = 0 we get the linearized system 


u(t) =rug =>  Uo(t) = u(0)e”. 
For the critical point y, = K we get the linearized system 
u(t)=—ru => wu(th=u(Oe™. 


From this last expression we can see that for yo = 0 the critical solution uy = 0 is unstable, 
while for y,; = K the critical solution u, = 0 is stable. The stability of the trivial solution 
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Uo = U1 = 0 of the linearized system coincides with the stability of the critical points y = 0, 
y, = K for the nonlinear equation. <J 


Notes 
This section follows a few parts of Chapter 2 in Steven Strogatz’s book on Nonlinear 
Dynamics and Chaos, [12], and also § 2.5 in Boyce DiPrima classic textbook [3]. 


6.1.5. Exercises. 


6.1.1.- . 


6. AUTONOMOUS SYSTI 


EMS AND STABILITY 


6.1.2.- . 
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6.2. Flows on the Plane 


We now turn to study two-dimensional nonlinear autonomous systems. We start reviewing 
the critical points of two-by-two linear systems and classifying them as attractors, repellers, 
centers, and saddle points. We then introduce a few examples of two-by-two nonlinear sys- 
tems. We define the critical points of nonlinear systems. We then compute the linearization 
of these systems and we study the linear stability of these two-dimensional nonlinear sys- 
tems. In the next section we solve a few examples from biology (predator-prey systems 
and competing species systems), and from physics (the nonlinear pendulum and potential 
systems). 


6.2.1. Two-Dimensional Nonlinear Systems. We start with the definition of au- 
tonomous systems on the plane. 


Definition 6.2.1. A first order two-dimensional autonomous differential equation is 
a = f(z), 


d 
where af = oe and the vector field f does not depend explicitly on t. 


Remark: If we introduce the vector components a(t) = al and f(z) = Pee 
2 2 hg 2' 


then the autonomous equation above can be written in components, 
/ 
x, = Fils oe 
i 
©, = fo(x1, 22), 


ae 
where x), = = for i = 1, 2. 


d 


Example 6.2.1 (The Nonlinear Pendulum). 
A pendulum of mass m, length @, oscillating 
under the gravity acceleration g, moves ac- 
cording to Newton’s second law of motion 


m(€0)" = —mgsin(6), : 


where the angle 6 depends on time t. If we \ 
rearrange terms we get a second order scalar \ 


| 
| 
. 2 7 / 
equation ‘ en 
a” + £ sin(@) =0. Me ! # 
x. ” 
This scalar equation can be written as a non- Mig ae 
linear system. If we introduce 7, = 6 and ee alll 
Ly = 9’, then 
FIGURE 11. Pendulum. 
L, = Xp 
r= —~ sin(2). 
J 


Example 6.2.2 (Predator-Prey). The physical system consists of two biological species 
where one preys on the other. For example cats prey on mice, foxes prey on rabbits. If we 
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call x, the predator population, and x, the prey population, then predator-prey equations, 
also known as Lotka-Volterra equations for predator prey, are 


/ 
L, = —4%,4+ 524%, 


/ 
Ly = —CL1L. + dL. 


The constants a,b,c, d are all nonnegative. Notice that in the case of absence of predators, 
x, = 0, the prey population grows without bounds, since 24, = da. In the case of absence of 
prey, 2, = 0, the predator population becames extinct, since x = —ax,. The term —c2,2, 
represents the prey death rate due to predation, which is porportional to the number of 
encounters, 2,22, between predators and prey. These encounters have a positive contribution 
ba,x, to the predator population. <J 


Example 6.2.3 (Competing Species). The physical system consists of two species that 
compete on the same food resources. For example, rabbits and sheep, which compete on 
the grass on a particular piece of land. If x, and x, are the competing species popultions, 
the the differential equations, also called Lotka-Volterra equations for competition, are 


x4 

vL=ry 24 (1-2 -az,), 
wip) 

= Tp%. (1- 2 - gaz). 


The constants 71,72, a, 8 are all nonnegative, and K,, K2 are positive. Note that in the case 
of absence of one species, say x, = 0, the population of the other species, x, is described by 
a logistic equation. The terms —a 2,2, and —$ 2,22 say that the competition between the 
two species is proportional to the number of competitive pairs 7,2>. <J 


6.2.2. Review: The Stability of Linear Systems. In § ?? we used phase portraits 
to display vector functions 
x, (t) 
a(t) = ; 
( ) ie Hl 


solutions of 2 x 2 linear differential systems. In a phase portrait we plot the vector x(t) 
on the plane 2,7, for different values of the independent variable t.We then plot a curve 
representing all the end points of the vectors a(t), for t on some interval. The arrows in the 
curve show the direction of increasing t. 


v2 


x(t) 


v1 


FIGURE 12. A curve in a phase portrait represents all the end points of 
the vectors x(t), for t on some interval. The arrows in the curve show the 
direction of increasing t. 
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We saw in § ?? that the behavior of solutions to 2-dimensional linear systems depend 
on the eigenvalues of the coefficient matrix. If we denote a general 2 x 2 matrix by 


then the eigenvalues are the roots of the characteristic polynomial, 
det(A — AI) = #7 — pr+q=0, 
where we denoted p = aj, + agg and g = @41 092 — 412021. Then the eigenvalues are 


\. _pt/p-4g_p, VA 


2 2 2” 
where A = p” — 4g. We can classify the eigenvalues according to the sign of A. In Fig 13 
we plot on the pq-plane the curve A = 0, that is, the parabola g = p?/4. The region above 
this parabola is A < 0, therefore the matrix eigenvalues are complex, which corresponds 
to spirals in the phase portrait. The spirals are stable for p < 0 and unstable for p > 0. 
The region below the parabola corresponds to real disctinct eigenvalues. The parabola itself 
corresponds to the repeated eigenvalue case. 


q 
Asymp. stable, spiral point | Unstable, spiral point J 
(Ce) Stable aR 
SS center * © 
7 sg 2 
a. f & A=p*-4q=0 
% < 
& sy 
Asymp. stable = s Unstable 
oC A=p2-—4q <0 $ node 


Unstable, saddle point P 


y 
A=p2-4q>0 dX 
a 


FIGURE 13. The stability of the solution a = 0. (Boyce DiPrima, § 9.1, [3].) 


The trivial solution 2% = 0 is called a critical point of the linear system a = Ag. Here 
is a more detailed classification of this critical point. 


Definition 6.2.2. The critical point % = 0 of a2 x 2 linear system a = Ax is: 


(a) an attractor (or sink), iff both eigenvalues of A have negative real part; 
(b) a repeller (or source), iff both eigenvalues of A have positive real part; 
(c) a saddle, iff one eigenvalue of A is positive and the other is negative; 
(d) a center, iff both eigenvalues of A are pure imaginary; 

(e) higher order critical point iff at least one eigenvalue of A is zero. 


The critical point a = 0 is called hyperbolic iff it belongs to cases (a-c), that is, the 
real part of all eigenvalues of A are nonzero. 
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We saw in § ?? that the behavior of solutions to a linear system a = Aa, with initial 
data x(0), depends on what type of critical point is a = 0. The results presented in that 
section can be summarized in the following statement. 


Theorem 6.2.3 (Stability of Linear Systems). Let x(t) be the solution of a2 x 2 linear 
system a’ = Ax, with det(A) 4 0 and initial condition «(0) = ay. 


(a) The critical point a = 0 is an attractor iff for any initial condition 2(0) the correspond- 
ing solution x(t) satisfies that lim:+.. a(t) = 0. 

(b) The critical point a = 0 is a repeller iff for any initial condition «(0) the corresponding 
solution a(t) satisfies that lim:,.. |a(t)| = oo. 

(c) The critical point a = 0 is a center iff for any initial data x(0) the corresponding 
solution a(t) describes a closed periodic trajectory around 0. 


Phase portraits will be very useful to understand solutions to 2-dimensional nonlinear 
differential equations. We now state the main result about solutions to autonomous systems 
x = f(a) is the following. 


Theorem 6.2.4 (IVP). If the field f differentiable on some open connected set D € R?, 
then the initial value problem 


v=f(2), a0) =m ED, 
has a unique solution x(t) on some nonempty interval (—t,,t,) about t = 0. 


Remark: The fixed point argument used in the proof of Picard-Lindel6f’s Theorem 1.6.2 
can be extended to prove Theorem 6.2.4. This proof will be presented later on. 


Remark: That the field f is differentiable on D € R? means that f is continuous, and all 
the partial derivatives Of;/Ox,;, for 1,7 = 1,2, are continuous for all # in D. 


Theorem 6.2.4 has an important corollary: different trajectories never intersect. If two 
trajectories did intersect, then there would be two solutions starting from the same point, the 
crossing point. This would violate the uniqueness part of the theorem. Because trajectories 
cannot intersect, phase portraits of autonomous systems have a well-groomed appearence. 


6.2.3. Critical Points and Linearization. We now extended to two-dimensional 
systems the concept of linearization we introduced for one-dimensional systems. The hope 
is that solutions to nonlinear systems close to critical points behave in a similar way to 
solutions to the linearized system. We will see that this is the case if the linearized system 
has distinct eigenvalues. Se start with the definition of critical points. 


Definition 6.2.5. A critical point of a two-dimensional system af = f(a) is a vector 
where the field f vanishes, 


f(a) = 0. 


Remark: A critical point defines a constant vector function x(t) = 2 for all t, solution of 
the differential equation, 
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fi 


In components, the field is f = | f 
2 


0 
| , and the critical point a = i is solution of 
2 


When there is more than one critical point we will use the notation x;, with i = 0,1,2,---, 
to denote the critical points. 
Example 6.2.4. Find all the critical points of the two-dimensional (decoupled) system 


gz, = —a, + (a)? 


/ 
Ly = —229. 


Solution: We need to find all constant vectors «= a solutions of 
2 


—ai + (21)? =0, 
—2 Lg = 0. 


From the second equation we get x, = 0. From the first equation we get 


(Ua)? - 1) =0 3S «2,=0, or 2, =+1. 


eae : 0 il —1 
Therefore, we got three critical points, % = al a= a ,o= | | : <J 
We now generalize to two-dimensional systems the idea of linearization introduced in 
§ 6.1 for scalar equations. Consider the two-dimensional system 


oA = fi(@1, £2), 


wv, = fo(@1, £2), 


0 
Assume that f,, fo have Taylor expansions at % = ie . We denote u, = (a, — 2°) and 


2 
Up = (2-28), and fe = fi (ao, 28), f2 = fo(x?, x8). Then, by the Taylor expansion theorem, 


Of Of. 
files, 22) = f+ an, bis Ut aes a Ug 1 o(uz,uz), 
of. Of. 
tn, = — an, is uy 4 On, ue o(uz, uz). 
Let us simplify the notation a bit further. Let us denote 
_ Of, _ Oh 
Of; — Ox, ae Oo fs = Ox, a 
Ofo Ofs 
Or fo Ax, aa > fo Ax, es 


then the Taylor expansion of f has the form 


fi(@1, 22) = fp + (Afr) us + (Gof) Ue + o(uz, us), 
fo(a1,@2) = fp + (Or fe) Us + (Oofe) Ue + 0(u?, uz). 
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We now use this Taylor expansion of the field f into the differential equation a = f. Recall 
that 7, = 7? + u,, and 2, = 7} + us, and that x? and 2} are constants, then 

uy, = fp + (Orfi) us + (Oofr) ue + o(uz, uz), 

uy = fp + (Of) ur + (Oofo) Us + 0(uz, uz). 


Let us write this differential equation using vector notation. If we introduce the vectors and 
the matrix 
U1 ft Of, oft 
i 4 — 5 D = 5 
* to 4 Io be O2 fr 


e=f(z) u =f, +(Dfo) ut o(|u). 
In the case that 2 is a critical point, then f, = 0. In this case we have that 
g=f(z) @& w =(Dfy)uto(|ul’). 


The relation above says that the equation coefficients of a/ = f(x) are close, order 0(|uJ?), 
to the coefficients of the linear differential equation u’ = (Df) u. For this reason, we give 
this linear differential equation a name. 


then, we have that 


Definition 6.2.6. The linearization of a two-dimensional system a’ = f(x) at a critical 
point Lp is the 2 x 2 linear system 


ul = (D fo) U, 
where u = &— X, and we have introduced the Jacobian matrix at a, 
oft oft 
Dy, = oa Lo dx2 xo — On Oo fs . 
Ofe ofa O1fo Onfe 
Ox x0 0x2 x0 


Remark: In components, the nonlinear system is 
/ 
x, = fila, £2); 


cs = fo(X1, 2), 


U1 _ On fi; Oo fi U1 
U2} Of O2fo} [Ue] 


Example 6.2.5. Find the linearization at every critical point of the nonlinear system 


and the linearization at 2 is 


a = —z, + (x,)° 


ty = 2g. 
Solution: We found earlier that this system has three critial points, 


a=} -[]- -fi] 


This means we need to compute three linearizations, one for each critical point. We start 
computing the derivative matrix at an arbitrary point a, 


Of. oft a 
we cts Bay (222) Bap (222) 
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so we get that 
_ [-14+32? 0 
Soa a Sk 
We only need to evaluate this matrix Df at the critical points. We start with 2, 
_ fo _f-1 0 a) [st Oly 
w=[] = pa-[F 3] = [il=[o 3} fe 


The Jacobian at x, and a is the same, so we get the same linearization at these points, 
~ ie _f2 0 us] [2 0] [us 
a=[] > Pe-[ 2) > [el=b Sa) 
_ ie _ [2 0 us|) [2 07 [us 
e=[o] > Pa=[ S] > [ty =[ 5] |e 


Critical points of nonlinear systems are classified according to the eigenvalues of their 
corresponding linearization. 


J 


Definition 6.2.7. A critical point a% of a two-dimensional system a! = f(a) is: 


(a) an attractor (or sink), iff both eigenvalues of Df) have negative real part; 
(b) a repeller (or source), iff both eigenvalues of Df have positive real part; 
(c) a saddle, iff one eigenvalue of Df is positive and the other is negative; 
(d) a center, iff both eigenvalues of Df) are pure imaginary; 

(e) higher order critical point iff at least one eigenvalue of Dfo is zero. 


A critical point %% is called hyperbolic iff it belongs to cases (a-c), that is, the real part 
of all eigenvalues of Df) are nonzero. 


Example 6.2.6. Classify all the critical points of the nonlinear system 
x, = —2, + (a,)* 


/ 
Ly = —2 Xo. 


Solution: We already know that this system has three critical points, 


We have already computed the linearizations at these critical points too. 


phr=[y %|, pa=pa=(5 2. 


We now need to compute the eigenvalues of the Jacobian matrices above. For the critical 
point % we have A, = —1, A- = —2, so & is an attractor. For the critical points a, and a, 
we have , = 2, A- = —2, so a, and ga are saddle points. <J 


6.2.4. The Stability of Nonlinear Systems. Sometimes the stability of two-dimensional 
nonlinear systems at a critical point is determined by the stability of the linearization at 
that critical point. This happens when the critical point of the linearization is hyperbolic, 
that is, the Jacobian matrix has eigenvalues with nonzero real part. We summarize this 
result in the following statement. 
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Theorem 6.2.8 (Hartman-Grobman). Consider a two-dimensional nonlinear autono- 
mous system with a continuously differentiable field f , 


a = f(z), 
and consider its linearization at a hyperbolic critical point Xp, 
ul = (Dfp) u. 


Then, there is a neighborhood of the hyperbolic critical point a where all the solutions of 
the linear system can be transformed into solutions of the nonlinear system by a continuous, 
invertible, transformation. 


Remark: The Hartman-Grobman theorem implies that the phase portrait of the linear 
system in a neighborhood of a hyperbolic critical point can be transformed into the phase 
portrait of the nonlinear system by a continuous, invertible, transformation. When that 
happens we say that the two phase portraits are topologically equivalent. 


Remark: This theorem says that, for hyperbolic critical points, the phase portrait of the 
linearization at the critical point is enough to determine the phase portrait of the nonlinear 
system near that critical point. 


Example 6.2.7. Use the Hartman-Grobman theorem to sketch the phase portrait of 
az = —z, + (x,)° 


/ 
Lo = — 22. 


Solution: We have found before that the critical points are 


where a is an attractor and 2,, 2 are saddle points. 


The phase portrait of the linearized systems = 

at the critical points is given in Fig 6.2.4. = 

These critical points have all linearizations Mk ar 

with eigenvalues having nonzero real parts. a AN a a 
This means that the critical points are hyper- | 

bolic, so we can use the Hartman-Grobman i 


theorem. This theorem says that the phase 
portrait in Fig. 6.2.4 is precisely the phase 
portrait of the nonlinear system in this ex- 
ample. 


FIGURE 14. Phase portraits 
of the linear systems at 2, 
Z,, and a. 


Since we now know that Fig 6.2.4 is also the phase portrait of the nonlinear, we only 
need to fill in the gaps in that phase portrait. In this example, a decoupled system, we 
can complete the phase portrait from the symmetries of the solution. Indeed, in the z, 
direction all trajectories must decay to exponentially to the x. = 0 line. In the z, direction, 
all trajectories are attracted to x, = 0 and repelled from x, = +1. The vertical lines 7, = 0 
and #, = +1 are invariant, since x, = 0 on these lines; hence any trajectory that start on 
these lines stays on these lines. Similarly, x2 = 0 is an invariant horizontal line. We also 
note that the phase portrait must be symmetric in both x, and x2 axes, since the equations 
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are invariant under the transformations 7, > —az, and xz, + —2,. Putting all this extra 
information together we arrive to the phase portrait in Fig. 15. 


FIGURE 15. Phase portraits of the nonlinear systems in the Example 6.2.7 


<q 


6.2.5. Competing Species. Suppose we have two species competing for the same 
food resources. Can we predict what will happen to the species population over time? Is 
there an equilibrium situation where both species cohabit together? Or one of the species 
must become extinct? If this is the case, which one? 

We study in this section a particular competing species system, taken from Strogatz 
[12], 


v= 2,(3—2,—222), (6.2.1) 
x, = £2 (2-22-21), (6.2.2) 


where 2,(t) is the population of one of the species, say rabbits, and x2(t) is the population 
of the other species, say sheeps, at the time t. We restrict to nonnegative functions 7,, X2. 

We start finding all the critical points of the rabbits-sheeps system. We need to find all 
constants 71, 22 solutions of 


x, (3 —2,—22,) =0, (6.2.3) 
Ly (2— 22-21) = 0. (6.2.4) 
From Eq. (6.2.3) we get that one solution is z, = 0. In that case Eq. (6.2.4) says that 


U(2—2,)=0 => a, =0 or 2, =2. 


So we got two critical points, % = Ie and x, = | . We now consider the case that x, 4 0. 
In this case Eq. (6.2.3) implies 
(3-—a,-2a,)=0 3S a,=3-22y. 
Using this equation in Eq. (6.2.4) we get that 
%=0, hence 2x, =3, 


U(2—X,-3+22%,)=0 => -2,(-l+a,)=0 => or 
%=1, hence x,=1. 


So we got two more critical points, 7 = H and a = fe We now proceed to find 
the linearization of the rabbits-sheeps system in Eqs.(6.2.1)-(6.2.2). We first compute the 
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derivative of the field f, where 
—_ | fil — [a1 (8-21 — 222) 
F(z) a p 7 Lo (2 — a2 — 2) : 


The derivative of f at an arbitrary point z is 


of. Of. 
Df(a) = Bac Bis _ |(8— 2a, — 2a) —2 2, 
~ | fe Of2 | —2Xo (2—a,—2a,)|° 
0x4 0x2 


We now evaluate the matrix Df(a) at each of the critical points we found. 


0 3 0 
At H= A we get (Df) = i | : 
This coefficient matrix has eigenvalues A. = 3 and A o- = 2, both positive, which means that 
the critical point x is a repeller. To sketch the phase portrait we will need the corresponding 
eigenvectors, uj = aI and v, = A 


At q= | we get (Df,) = E 4 , 


This coefficient matrix has eigenvalues \,, = —1 and A, = —2, both negative, which means 
that the critical point x, is an attractor. One can check that the corresponding eigenvectors 


P 1 __ |0 
are U; = Ej and v, = Ae 


At m= Hl we get (Df,) = Bi e| 


This coefficient matrix has eigenvalues A, = —1 and A,- = —3, both negative, which means 
that the critical point x, is an attractor. One can check that the corresponding eigenvectors 


+_ {7-3 - 1 
are U, = | | and v, = al: 


At a= Hl we get (Df) = ir oI ; 


One can check that this coefficient matrix has eigenvalues \3, = —1+ V2 and \3. = —1— v2, 
which means that the critical point x3 is a saddle. One can check that the corresponding 


eigenvectors are Uv; = Re and v, = al We summarize this information about the 


linearized systems in the following picture. 

We would like to have the complete phase portrait for the nonlinear system, that is, we 
would like to fill the gaps in Fig. 17. This is difficult to do analytically in this example as 
well as in general nonlinear autonomous systems. At this point is where we need to turn to 
computer generated solutions to fill the gaps in Fig. 17. The result is in Fig. 18. 

We can now study the phase portrait in Fig. 18 to obtain some biological insight on 
the rabbits-sheeps system. The picture on the right says that most of the time one species 
drives the other to extinction. If the initial data for the system is a point on the blue region, 


called the rabbit basin, then the solution evolves in time toward the critical point a = Hl : 


This means that the sheep become extinct. If the initial data for the system is a point on 
the green region, called the sheep basin, then the solution evolves in time toward the critical 


point 2, = A . This means that the rabbits become extinct. 
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FIGURE 16. The linearizations of the rabbits-sheeps system in Eqs. (6.2.1)-(6.2.2). 


We now notice that all these critical points 
have nonzero real part, that means they 
are hyperbolic critical points. Then we can 
use Hartman-Grobman Theorem 6.2.8 to con- 
struct the phase portrait of the nonlinear sys- 
tem in (6.2.1)-(6.2.2) around these critical 
points. The Hartman-Grobman theorem says 
that the qualitative structure of the phase por- 
trait for the linearized system is the same for 
the phase portrait of the nonlinear system 
around the critical point. So we get the pic- 
ture in Fig. 17. 


a. 


é 
“ a ey, 


FIGURE 18. The phase portrait of the rabbits-sheeps system in Eqs. (6.2.1)-(6.2.2). 


[2] 


alfracter [#] sedevle 


FIGURE 17. Phase Portrait 
for Eqs. (6.2.1)-(6.2.2). 
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The two basins of attractions are separated by a curve, called the basin boundary. Only 
when the initial data lies on that curve the rabbits and sheeps coexist with neither becoming 


F : a : 1 : 
extinct. The solution moves towards the critical point x, = Hl . Therefore, the populations 


of rabbits and sheep become equal to each other as time goes to infinity. But, if we pick 
an initial data outside this basin boundary, no matter how close this boundary, one of the 


species becomes extinct. 
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6.2.6. Exercises. 


6.2.1.- * Consider the autonomous system 


xv =a(l—a2—-y) 


,_ (3 1 

v=o(}-0-d) 

(1) Find the critical points of the 
system above. 

(2) Find the linearization at each 
of the critical points above. 

(3) Classify the critical points 
above as attractors, repellers, 
or saddle points. 

(4) Sketch a qualitative phase 
portrait of the solutions of the 
system above. 


EMS AND STABILITY 


6.2.2.- . 


CHAPTER 7 


Boundary Value Problems 


We study the a simple case of the Sturm-Liouville Problem, we then present how to compute 
the Fourier series expansion of continuous and discontinuous functions. We end this chapter 
introducing the separation of variables method to find solutions of a partial differential 
equation, the heat equation. 


Insulation 


u(t,0) = 0 


Zz Insulation 
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7.1. Eigenfunction Problems 


In this Section we consider second order, linear, ordinary differential equations. In the 
first half of the Section we study boundary value problems for these equations and in the 
second half we focus on a particular type of boundary value problems, called the eigenvalue- 
eigenfunction problem for these equations. 


7.1.1. Two-Point Boundary Value Problems. We start with the definition of a 
two-point boundary value problem. 
Definition 7.1.1. A two-point boundary value problem (BVP) is the following: Find 
solutions to the differential equation 
y" + a,(x) y! + ao(x) y = O(a) 

satisfying the boundary conditions (BC) 

br y(a1) + bo y'(a1) = 1, 

by y(X2) + be y' (2) = Yr, 


where b,, bo, by, bo, X14, Lo, Y1, and Y2 are given and x, # x2. The boundary conditions are 
homogeneous iff y, =0 and y, = 0 


Remarks: 


(a) The two boundary conditions are held at different points, x, 4 Xp. 
(b) Both y and y’ may appear in the boundary condition. 


Example 7.1.1. We now show four examples of boundary value problems that differ only 
on the boundary conditions: Solve the different equation 


y" a ay y ai ay = e 2! 
with the boundary conditions at x, = 0 and x, = 1 given below. 


(a) 


y(0) = M1; by = 1, bs = 0, 
Boundary Condition: which is the case 7 7 
y(1) = Ya, by = 1, be = 0. 
(b) 
y(0) =%, by =1, b =0, 
Boundary Condition: ' which is the case i ‘ 
y 1) = %, b, = 0, bo = 1. 
(c) 
y (0) =, b,=0, b& =1, 
Boundary Condition: which is the case es és 
y(1) = %, b, =1, b, = 0. 
(d) 
y 0=n, b, = 0, b, = 1, 
Boundary Condition: : which is the case i . 
y 1) = %, b, = 0, bo = 1. 
(ec) 
2 0 + ‘ 0 — ; b = 2, b = 1, 
BC: a ) a )=% which is the case - ‘i 
y'(1) + 3y"(1) = ye, i= 1, ty =: 
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7.1.2. Comparison: IVP and BVP. We now review the initial boundary value 
problem for the equation above, which was discussed in Sect. 2.1, where we showed in 
Theorem 2.1.2 that this initial value problem always has a unique solution. 


Definition 7.1.2 (IVP). Find all solutions of the differential equation y” +.a,y' +a. y = 0 
satisfying the initial condition (IC) 


y(to) = Yo, y(t) = M1. (7.1.1) 


Remarks: In an initial value problem we usually the following happens. 


e The variable t represents time. 
e The variable y represents position. 
e The IC are position and velocity at the initial time. 


A typical boundary value problem that appears in many applications is the following. 


Definition 7.1.3 (BVP). Find all solutions of the differential equation y” +a, y' +a). y = 0 
satisfying the boundary condition (BC) 


yO)=y, yL)=m%, LAO. (71,2) 


Remarks: In a boundary value problem we usually the following happens. 


e The variable x represents position. 
e The variable y may represents a physical quantity such us temperature. 
e The BC are the temperature at two different positions. 


The names “initial value problem” and “boundary value problem” come from physics. 
An example of the former is to solve Newton’s equations of motion for the position function 
of a point particle that starts at a given initial position and velocity. An example of the 
latter is to find the equilibrium temperature of a cylindrical bar with thermal insulation on 
the round surface and held at constant temperatures at the top and bottom sides. 

Let’s recall an important result we saw in § 2.1 about solutions to initial value problems. 


Theorem 7.1.4 (IVP). The equation y" +a, y' +a y = 0 with IC y(to) = yo and y' (to) = ys 
has a unique solution y for each choice of the IC. 


The solutions to boundary value problems are more complicated to describe. A bound- 
ary value problem may have a unique solution, or may have infinitely many solutions, or 
may have no solution, depending on the boundary conditions. In the case of the boundary 
value problem in Def. 7.1.3 we get the following. 


Theorem 7.1.5 (BVP). The equation y" +a, y’ +a y = 0 with BC y(0) = yo and y(L) = m1, 

wilt L £0 and with r+ roots of the characteristic polynomial p(r) = r? + air + do, satisfy 

the following. 

(A) Ifr, Ar- are reals, then the BVP above has a unique solution for all yo, y: € R. 

(B) Ifrs =a+iP are complez, with a, B € R, then the solution of the BVP above belongs 
to one of the following three possibilities: 

(i) There exists a unique solution; 

(it) There exists infinitely many solutions; 
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(iti) There exists no solution. 


Proof of Theorem 7.1.5: 
Part (A): If r, 4 r- are reals, then the general solution of the differential equation is 


y(“z) =cae™** +c.e", 


The boundary conditions are 


Y =y(0) =a te. m | 1 1 | ¢ - [2 
Y= y(L) =G eck +e ect enh eb C. Ys : 


This system for c., c- has a unique solution iff the coefficient matrix is invertible. But its 
determinant is 


Therefore, if the roots r, 4 r_ are reals, then e’-’ 4 e’™”, hence there is a unique solution 
c+, C-, which in turn fixes a unique solution y of the BVP. 

In the case that r, = r- = ro, then we have to start over, since the general solution of 
the differential equation is 


y(x) = (cy + ce) x)”, C1,C, ER. 


Again, the boundary conditions in Eq. (7.1.2) determine the values of the constants c, and 
C, as follows: 


Yo — y(0) = Cy 1 0 Ct Yo 
> L roL — . 
yn = y(L) = ce" + Le eo" Le Co Vs 
This system for c,;, c. has a unique solution iff the coefficient matrix is invertible. But its 
determinant is 


1 0 


==> roL 
enol Lert =Le 


So, for L ~ 0 the determinant above is nonzero, then there is a unique solution c,, ¢., which 
in turn fixes a unique solution y of the BVP. 
Part (B): If rs =a+if8, that is complex, then 


= el#iA)l = eo! (cos(BL) + isin(BL)), 


therefore 
e” © — e™" = @°" (cos(BL) — isin(8L) — cos(BL) — isin(BL)) 
= —2ie* sin(BL). 
We conclude that 
el — eM) — 26% sin(BL)=0 & BL=nn. 


So for BL 4 nz the BVP has a unique solution, case (Bi). But for 6Z = nz the BVP has 
either no solution or infinitely many solutions, cases (Bii) and (Biii). This establishes the 
Theorem. 


Example 7.1.2. Find all solutions to the BVPs y” + y = 0 with the BCs: 


y(0) = 1, y(0) =1, y(0) =1, 
(2) fee =p, ee ag. fee =-1. 


Solution: We first find the roots of the characteristic polynomial r? + 1 = 
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r4 = +1. So the general solution of the differential equation is 

y(x) = c, cos(x) + ce, sin(z). 
BC (a): 

1=y(0)=¢ gq =1 

0= y(t) =-cG c, = 0 

Therefore, there is no solution. 
BC (b): 

1=y(0)=«¢, G1 


So there is a unique solution y(«) = 


l=y(n/2)=Q > GQeH=l. 
os(a#) + sin(«). 


BC (c): 


1=y(0)=«¢, CoS. 


-l=y(m)=-Y GQ=l. 


Therefore, c, is arbitrary, so we have infinitely many solutions 


Example 7.1.3. Find all solutions to the BVPs y+ 4y = 0 with the BCs: 


y(0) =1, y(0) = 1, 
- ee =-1. (b) ee ae; (c) iy 


Solution: We first find the roots of the characteristic polynomial r? + 4 = 


r 


y(x) = cos(x) + ~ sin(z), co ER. 


+27. So the general solution of the differential equation is 


y(x) = c, cos(2x) + c, sin(2z). 


BC (a): 


1=y(0)=¢ c, = 1. 


-l=y(n/4)=Q => GQ=-l. 


Therefore, there is a unique solution y(a) = cos(2x) — sin(2z). 
BC (b): 


1=y(0)=¢ c, = 1. 
-1l=y(n/2) =-c cq =. 


So, c, is arbitrary and we have infinitely many solutions 


y(x) = cos(2x) + c sin(2z), CG ER. 


BC (c): 


1=y(0)=¢ c, = 1. 
= y(7/2) = -c, CG =-1. 


Therefore, we have no solution. 
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0, that is, 


0, that is, 
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7.1.3. Eigenfunction Problems. We now focus on boundary value problems that 
have infinitely many solutions. A particular type of these problems are called an eigenfunc- 
tion problems. They are similar to the eigenvector problems we studied in § 8.3. Recall that 
the eigenvector problem is the following: Given an n x n matrix A, find all numbers A and 
nonzero vectors v solution of the algebraic linear system 


Av= uv. 
We saw that for each A there are infinitely many solutions v, because if v is a solution so is 


any multiple av. An eigenfunction problem is something similar. 


Definition 7.1.6. An eigenfunction problem is the following: Given a linear operator 
L(y) =y" +a,y' + aoy, find a number A and a nonzero function y solution of 


with homogeneous boundary conditions 


by y(a1) + be y' (#1) =0 
b, y(v) + by y' (x2) = 0. 


Remarks: 


Notice that y = 0 is always a solution of the BVP above. 

Eigenfunctions are the nonzero solutions of the BVP above. 

Hence, the eigenfunction problem is a BVP with infinitely many solutions. 

So, we look for \ such that the operator L(y) + Ay has characteristic polynomial 
with complex roots. 

e So, » is such that L(y) + Ay has oscillatory solutions. 

e Our examples focus on the linear operator L(y) = y”. 


Example 7.1.4. Find all numbers \ and nonzero functions y solutions of the BVP 


y’+Ay=0, with y(0)=0, y(L)=0, L>0. 


Solution: We divide the problem in three cases: (a) A < 0, (b) A = 0, and (c) A> 0. 
Case (a): \ = —p? <0, so the equation is y’ — u?y = 0. The characteristic equation is 


r? — p? =0 TS Ep: 
The general solution is y = c, e"* + c_e~#*. The BC imply 
0=y(0)=at+e, O=y¥(L) =caet@ tee, 
So from the first equation we get c, = —c_, so 
O=-cet¥t+ee#h > -c(et’—e#*)=0 > « =0, « =0. 


So the only the solution is y = 0, then there are no eigenfunctions with negative eigenvalues. 
Case (b): \ = 0, so the differential equation is 


y=0 > y=ot+eye. 
The BC imply 


0 = y(0) = cp, 0=y(L) =aL Cc, = 0. 
So the only solution is y = 0, then there are no eigenfunctions with eigenvalue \ = 0. 
Case (c): \ = 1? > 0, so the equation is y” + y?2y = 0. The characteristic equation is 


r? +p? =0 rT, = Ey. 
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The general solution is y = c, cos(jx) + c- sin(ux). The BC imply 
0=y(0) =a, 0 = y(L) = c, cos(uL) + c-sin(uL). 
Since c, = 0, the second equation above is 
c-sin(wL) = 0, c #0 =) sin(ul)=0 >  ppL=nn. 


So we get fin = n7/L, hence the eigenvalue eigenfunction pairs are 
2 
An = (+) ‘ Yn(x) = Cn sin(“*). 
L 
Since we need only one eigenfunction for each eigenvalue, we choose c,, = 1, and we get 
2 
n= (S), une) =sin(*), nd. 
L 
dq 
Example 7.1.5. Find the numbers 4 and the nonzero functions y solutions of the BVP 
y’ +rAy = 0, y(0) = 0, y Ls =0, L> 0. 


Solution: We divide the problem in three cases: (a) A < 0, (b) A = 0, and (c) A> 0. 
Case (a): Let A = —p/?, with p > 0, so the equation is y” — 2 y = 0. The characteristic 
equation is 


r?— yw? =0 Ts = =H, 
The general solution is y(x) = c,e~#* + ce”. The BC imply 


0= y(0) =C+ Co, | 1 1 a 4 
/ eh L > a i = . 
O=y(£) =—yue,e *" + pce” pe pe Co 0 


The matrix above is invertible, because 


1 1 
—pe BE pet 


= p(ehh + en) #0. 


So, the linear system above for c,, c. has a unique solution c,; = c, = 0. Hence, we get the 
only solution y = 0. This means there are no eigenfunctions with negative eigenvalues. 
Case (b): Let \ = 0, so the differential equation is 


y’=0 = y(t) =o4+ E2, 1,6, ER. 
The boundary conditions imply the following conditions on c, and cp, 
0=y(0) =a, 0=y'(L) =e. 


So the only solution is y = 0. This means there are no eigenfunctions with eigenvalue \ = 0. 
Case (c): Let A = p?, with > 0, so the equation is y” + yu? y = 0. The characteristic 
equation is 


r?+w?=0 Tr, = Epi. 
The general solution is y(x) = c; cos(jux) + c. sin(ux). The BC imply 


0 = y(0) = C1; 


=> ccos(uL) =0. 
0=y'(L) = —pe, sin(wL) + pce na meee) 


Since we are interested in non-zero solutions y, we look for solutions with c. 4 0. This 
implies that ~ cannot be arbitrary but must satisfy the equation 
T 


cos(uL)=0 & pL = (2n-1) 5 


3 
W 
a 
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We therefore conclude that the eigenvalues and eigenfunctions are given by 


2n —1)?2n? 2n—-1 
py = un () = ey sin( OP Uae), ni. 
Since we only need one eigenfunction for each eigenvalue, we choose c,, = 1, and we get 
2n —1)2n? 2n-1 
dy = yn(a) = sin(! —), n21. 


dq 


Example 7.1.6. Find the numbers 4 and the nonzero functions y solutions of the BVP 
gy” —axy' =—dry, y(1)=0, y(@)=0, &>1. 
Solution: Let us rewrite the equation as 
xy” —axy' +rA(y = 0. 
This is an Euler equidimensional equation. From § 2.4 we know we need to look for the 
solutions r, of the indicial polynomial 


r(r—1)—r+A=0 r?—Ir+A=0 re =14vV1-.. 


Case (a): Let 1— \ = 0, so we have a repeated root r, = r_ = 1. The general solution to 
the differential equation is 
y(a) = (e, + & In(z)) x. 
The boundary conditions imply the following conditions on c, and cy, 
0=y(l) =a, 
0=y(0) = (a +eln(Q)) £ 


So the only solution is y = 0. This means there are no eigenfunctions with eigenvalue \ = 1. 


> olIn()=0 > G=0. 


Case (b): Let 1— > 0, so we can rewrite it as 1— A = w?, with » > 0. Then, re =1+yp, 
and so the general solution to the differential equation is given by 


y(x) = 29) + eat), 


The boundary conditions imply the following conditions on c, and c, 


0=y(1)=a+e, 1 1 c} — {0 
0=y(0) =c00-) 4,004 ( 7 [et e040] Je] = [of 


The matrix above is invertible, because 


1 1 
ea-n) e+) 


=e (0-0) £0 e#+H1. 


Since £ > 1, the matrix above is invertible, and the linear system for c,, c. has a unique 
solution given by c, = c, = 0. Hence we get the only solution y = 0. This means there are 
no eigenfunctions with eigenvalues \ < 1. 


Case (c): Let 1—. < 0, so we can rewrite it as 1—A = —p?, with p > 0. Then rz = 1+ip, 
and so the general solution to the differential equation is 


y(x) = x[c, cos(juIn(x)) + ce sin(uIn(z))]. 
The boundary conditions imply the following conditions on c, and cp, 
0=y(1) =a, 
0 = y(4) = c,€cos(pIn(0)) + c€sin((uIn(0)) 


=> osin(uln(é)) =0. 
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Since we are interested in nonzero solutions y, we look for solutions with c. # 0. This 
implies that ~ cannot be arbitrary but must satisfy the equation 
sin(win(@)) =0 = ppln(é) =nz, n>1. 
Recalling that 1 — A, = —y2, we get An = 1+ 2, hence, 


nx? nr In(a) 
An =1+-3-—, in (2 = cn sin( 7), n>. 
mo — int) In(0) 
Since we only need one eigenfunction for each eigenvalue, we choose c,, = 1, and we get 
na nt In(a) 
An =1+-—53-—; Un (2 =2sin( =), nee Us 
mp? — in) In(0) 
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7.1.4. Exercises. 


7.1.1.- . 7.1.2.- . 
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7.2. Overview of Fourier series 


A vector in three dimensional space can be decomposed as a linear combination of its com- 
ponents in a vector basis. If the basis vectors are all mutually perpendicular—an orthogonal 
basis—then there is a simple formula for the vector components. This is the Fourier expan- 
sion theorem for vectors in space. In this section we generalize these ideas from the three 
dimensional space to the infinite dimensional space of continuous functions. We introduce 
a notion of orthogonality among functions and we choose a particular orthogonal basis in 
this space. Then we state that any continuous function can be decomposed as a linear 
combination of its components in that orthogonal basis. This is the Fourier series expansion 
theorem for continuous functions. 


7.2.1. Fourier Expansion of Vectors. We review the basic concepts about vectors 
in R®° we will need to generalize to the space of functions. These concepts include: the 
dot (or inner) product of two vectors, orthogonal and orthonormal set of vectors, Fourier 
expansion (or orthonormal expansion) of vectors, and vector approximations. 


Definition 7.2.1. The dot product of two vectors u, v € R? is 
u-u= |u||v| cos(A), 


with |u|, |v| the magnitude of the vectors, and 6 € [0,7] the angle in between them. 


The magnitude of a vector u can be written as 
” |u| = /u-u. 


A vector wis a unit vector iff 


z 


—— u-u=i. 
Any vector v can be rescaled into a unit vector by di- 
y viding by its magnitude. So, the vector u below is a 
unit vector, 
v 
x “u= —. 
|v| 
The dot product tries to capture the notion of projection of one vector onto another. 
In the case that one of the vectors is a unit vector, the dot product is exactly the projection 
of the second vector onto the unit vector, 


v- u= |v|cos(6), for |u| =1. 
The dot product above satisfies the following properties. 
Theorem 7.2.2. For every u, v, w € R? and every a, b € R holds, 
(a) Positivity: u-u=0 iffu=0; and u-u>0 for cbuF¢ 0. 


(b) Symmetry: wu. v=v-u. 
(c) Linearity: (au+ bv). w=a(u-w)+b(v- w). 


When two vectors are perpendicular—no projection of one onto the other—their dot 
product vanishes. 


Theorem 7.2.3. The vectors u, v are orthogonal (perpendicular) iff u- v= 0. 


A set of vectors is an orthogonal set if all the vectors in the set are mutually perpendic- 
ular. An orthonormal set is an orthogonal set where all the vectors are unit vectors. 
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Example 7.2.1. The set of vectors {%,7,k} used in physics is an orthonormal set in R?. 


Solution: These are the vectors 
1 


As we can see in Fig. 1, they are mutually perpendic- 
ular, and have unit magnitude, that is, 


eye: tisk 
Ph IeRei, 


FIGURE 1. Vectors 2,7 ,k. 


The Fourier expansion theorem says that the set above is not just a set, it is a basis—any 
vector in R? can be decomposed as a linear combination of the basis vectors. Furthermore, 
there is a simple formula for the vector components. 


Theorem 7.2.4. The orthonormal set {i, 7, k} is an orthonormal basis, that is, every vector 
v € R® can be decomposed as 

V= Vz t+ Vy J+ v, k. 
The orthonormality of the vector set implies a formula for the vector components 


Uz = V+ 4, Vy = U-j, vz =v-k. 


The vector components are the dot product of the whole vector with each basis vector. 
The decomposition above allow us to introduce vector approximations. 


The Fourier expansion of a vector in an 
orthonormal basis allows us to introduce 
vector approximations. We just cut the 
Fourier expansion at the first, second, or 
third term: 


wv) =v, 1, 


y?) = Vz tt Vy J, 


y) = Vz tt Vyj +z k. 


Such vector approximations are silly to 
do in three dimensional space. But they 
can be useful if we work in a large di- 
mensional space. And they become an 
essential tool when we work in an infinite 
FIGURE 2. Fourier expansion of a dimensional space, such as the space of 
vector in an orthonormal basis. continuous functions. 
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7.2.2. Fourier Expansion of Functions. The ideas described above for vectors in 
R® can be extended to functions. We start introducing a notion of projection—hence of per- 
pendicularity—among functions. Unlike it happens in R*, we now do not have a geometric 
intuition that can help us find such a product. So we look for any dot product of functions 
having the positivity property, the symmetry property, and the linearity property. Here is 
one product with these properties. 


Definition 7.2.5. The dot product of two functions f, g on |[—L, LI] is 


Lb 
feo=f se)o(e)ae 


The dot product above takes two functions and produces a number. And one can verify 
that the product has the following properties. 
Theorem 7.2.6. For every functions f, g, h and every a, b € R holds, 


(a) Positivity: f- f =0 iff f =0; and f- f >0 for f £0. 
(b) Symmetry: f-g= g-f. 
(c) Linearity: (af +bg)-h=a(f-h)+b(g-h). 


Remarks: The magnitude of a function f is the nonnegative number 


lfll=VF-F = (Kio 


We use a double bar to denote the magnitude so we do not confuse it with |f|, which means 
the absolute value. A function f is a unit function iff f-. f =1. 


Since we do not have a geometric intuition for perpendicular functions, we need to define 
such a notion on functions using the dot product. Therefore, the following statement for 
functions is a definition, unlike for vectors in space where it is a theorem. 


Definition 7.2.7. Two functions f, g are orthogonal (perpendicular) iff f-g = 0. 


A set of functions is an orthogonal set if all the functions in the set are mutually perpen- 
dicular. An orthonormal set is an orthogonal set where all the functions are unit functions. 


Theorem 7.2.8. An orthonormal set in the space of continuous functions on [—L, L] is 
fa ih 7 1 ee . 1. eo 
Up = — =, ty = —cos| —], 0, = —=sin| — ; 
. J2L af Li L VL L n=1 


Remark: To show that the set above is orthogonal we need to show that the dot product 
of any two different functions in the set vanishes—the three equations below on the left. To 
show that the set if orthonormal we also need to show that all the functions in the set are 
unit functions—the two equations below on the right. 

tim: Un =0, men. Un*Un=1, for alln. 

Om:+Un =0, mn. Un + Un =1, for all n. 


Um + Un =0, for all m,n. 
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Example 7.2.2. The normalization condition is simple to see, because for n > 1 holds 


= 4 


2 
NT2x 1 ("=") 1 y. (ae) 1 

cos COs dz = = cos* | —])dx=—L=1. 
_trVvL ( L a L Lj_y, L L 


Un * Un = 
dj 


The orthogonality of the set above is equivalent to the following statement about the 
functions sine and cosine. 


Theorem 7.2.9. The following relations hold for alln, m EN, 


Proof of Theorem 7.2.9: Just recall the following trigonometric identities: 
cos(@) cos(¢) = ; [cos(0 + d) + cos(6 — ¢)], 
sin(@) sin(¢) = ; [cos(@ — d) — cos(6 + ¢)], 
gin(#) abatey = ; [sin(6 + ) + sin(6 — ¢)]. 


So, From the trigonometric identities above we obtain 
L L ( 


J. cos(“*) cos (™*") dz = a cos |" tm)re) dx + ; I. cos) amr) 


First, assume n > 0 or m > 0, then the first term vanishes, since 
a L L 
= cos (ie a ua dx = sin ican uous | 
2 L L —L 


= 0. 
_L 2(n+m)r 


Still for n > 0 or m > 0, assume that n 4 m, then the second term above is 


L n—-— Mm)rx 7 Mr — ™M)TL 
5 feos cn iw me sin! L I, rar 


Again, still for n > 0 or m > 0, assume that n = m 4 0, then 


iy (n — m)rax 1. 7 
5 [| Z Jac=5f de=t. 


Finally, in the case that both n = m = 0 is simple to see that 


L L 
|, °(") cos(“™*) dv= de =2L. 


The remaining equations in the Theorem are proven in a similar way. This establishes the 
Theorem. 
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Remark: Instead of an orthonormal set we will use an orthogonal set, which is often used 
in the literature on Fourier series: 


Theorem 7.2.10 (Fourier Expansion). The orthogonal set 


1 NTL NAL, ) 
= =, Un, = cos( — in = (— 2.1 
{Wt os cos ( Fa ), " ( L Vhs aa) 


is an orthogonal basis of the space of continuous functions on |—L, L], that is, any continuous 
function on [—L,L] can be decomposed as 


es = + are cos("**) + bp, sin(“"*)). 
n=l 


Moreover, the coefficients above are given by the formulas 


te = bf sea 
5, = ai f(x) cos(“**) dx, 
bn = a8 f(a) sin(*) a 


Furthermore, if f is piecewise continuous, then the function 


fr(t) = © + 5° (an 008") +b, sin(™™)) 
n=1 


satisfies f(x) = f(x) for alla where f is continuous, while for all zo where f is discontin- 
uous it holds 
i 


fr(2o) 5) 


(lim, f(e) + lim_f(x)) 


I+Xy 


Idea of the Proof of Theorem 7.2.10: It is not simple to prove that the set in 7.2.1 is 
a basis, that is every continuous function on [—L, L] can be written as a linear combination 


fla) = % + 3° (an c05("™) + bn sin(™**)). 
n=1 


We skip that part of the proof. But once we have the expansion above, it is not difficult to 
find a formula for the coefficients ao, ay, and by, for n > 1. To find a coefficient b,, we just 
multiply the expansion above by sin(mza/L) and integrate on [—L, L], that is, 


f(z) -sin(“**) - (= +O cos(“**) + by sin(“**))) -sin(“**). 
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The linearity property of the dot product implies 
f(a): sin(“™**) = io ; . sin(™**) 
= NTx . (mre 
+ Dan coe“) -en(“F) 
5 b sin( om . (MTX 
Sob sin(“22) -sin( 222), 


But (1/2)-sin(maa/L) = 0, since the sine functions above are perpendicular to the constant 
functions. Also cos(naa/L) - sin(maa/L) = 0, since all sine functions above are perpendic- 
ular to all cosine functions above. Finally sin(nax/L) - sin(maxz/L) = 0 for m 4 n, since 
sine functions with different values of m and n are mutually perpendicular. So, on the right 
hand side above it survives only one term, n = m, 

a sn(2)=g ™2) (2). 
But on the right hand side we got the magnitude square of the sine function above, 


; (“22 : (™22) = : (“E\ |" = 2 
sin T sin 7 = |/sin 7 =f. 


Therefore, 
. (Mnrx 1 Zh . (Mrex 
f(x) : sin(““**) = bm L > bm = L /. f(x) sin(““**) dz. 


To get the coefficient a,,, multiply the series expansion of f by cos(mmx/L) and integrate 
on [—L, L], that is 


MTL) _ (Ao oS NTx _ (NT Manx 
f(a): cos(“**) — ( 5 + 2 (™ cos(“T ) + by sin(“* ))) cos (“EE ?. 
As before, the linearity of the dot product together with the orthogonality properties of the 
basis implies that only one term survives, 


f( ) 5 (a) _ (=) Ps (7) 

x OS = Am COS L OSs A Z 
(FE) -em(7E) = los“) = 4 

cos cos 7 => S ip =h, 


f(a): cos(*"*) =AmrbL > am= ; a f(x) cos( =) dx. 


The coefficient a> is obtained integrating on [—L, L] the series expansion for f, and using 
that all sine and cosine functions above are perpendicular to the constant functions, then 


we get 
L L 
/ f(a),da = f de = 9F, 
aT, 2 Jip 2 


L 
ao = z |, fovae. 


We also skip the part of the proof about the values of the Fourier series of discontinuous 
functions at the point of the discontinuity. 


Since 


we get that 


so we get the formula 
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We now use the formulas in the Theorem above to compute the Fourier series expansion 
of a continuous function. 


for «x € [0,3] 


x 
Example 7.2.3. Find the Fourier expansion of f(a) = 2 
0, for «€ [-3,0). 


Solution: The Fourier expansion of f is 


fr(x) = 7 po cos ("**) + Dy, sin(“**) 


In our case L = 3. We start computing b,, for n > 1, 
3 
rl f(x)sin(2*) dx 
3 3 
3 
= | : sin(“**) dx 
3 Jo 3 3 


“( 3x (222) 4 9 in(*22) | 
9\ nr ne 3 nen ss 3 0 


- 5 (—~ cos(nn) +0 +0-0), 


1 
bn = = 
3 
1 


therefore we get 


A similar calculation gives us a, = 0 for n > 1, 
1 3 
An = ae f(z) cos(“7*) da 
_ 1 [ x ("=") 4 
—3 4,3 cos 3 x 


_ (= ? (“=*) ' 9 (“**))/ 
~ O\nt ear 3 ' 272 ea 3 ny 
1 


therefore we get 


Finally, we compute do, 


Therefore, we get 
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7.2.3. Even or Odd Functions. The Fourier series expansion of a function takes a 
simpler form in case the function is either even or odd. More interestingly, given a function 
on [0, Z] one can extend such function to [—L, L] requiring that the extension be either even 
or odd. 


Definition 7.2.11. A function f on [—L, L] is: 


e even iff f(—x) = f(x) for all x € [-L, L}; 
e odd iff f(—x) = —f(x) for all x € [-L, L}. 


Remark: Not every function is either odd or even. The function y = e*” is neither even nor 
odd. And in the case that a function is even, such as y = cos(x), or odd, such as y = sin(), 
it is very simple to break that symmetry: add a constant. The functions y = 1+ cos(x) and 
y = 1+sin(z) are neither even nor odd. 


Below we now show that the graph of a typical even function is symmetrical about the 
vertical axis, while the graph of a typical odd function is symmetrical about the origin. 


Example 7.2.4. The function y = x? is even, while the function y = 2° is odd. 


FI E4. y=2°i : 
FIGURE 3. y= a2 is even. GUR y x° is odd 


We now summarize a few property of even functions and odd functions. 


Theorem 7.2.12. If fe, ge are even and hy, £4 are odd functions, then: 


(1) afe+bge is even for alla, bER. 
(2) aho +b, is odd for alla, bER. 
(3) fege is even. 
(4) hoo 18 even. 
(5) felo is odd. 


(6) [i teav=2 f foae. 


(7) [ roae=o. 


7.2. OVERVIEW OF FOURIER SERIES 321 


Remark: We leave proof as an exercise. Notice that the last two equations above are simple 
to understand, just by looking at the figures below. 


FIGURE 5. Integral of an FIGURE 6. Integral of an 
even function. odd function. 


7.2.4. Sine and Cosine Series. In the case that a function is either even or odd, half 
of its Fourier series expansion coefficients vanish. In this case the Fourier series is called 
either a sine or a cosine series. 


Theorem 7.2.13. Let f be a function on [—L, L| with a Fourier expansion 
F(a) = + Yo foncos("") +bysin("7)) 
(a) If f is even, then b, =0. The series Fourier series is called a cosine series, 
fa)= = + yo cos("**). 
(b) If f is odd, then a, =0. The series Fourier series is called a sine series, 


fiz) = > Dey sin(“**). 


n=1 


Proof of Theorem 7.2.13: 
Part (a): Suppose that f is even, then for n > 1 we get 


bn tf f(x) sin(“**) dx, 


but f is even and the Sine is odd, so the integrand is odd. Therefore b, = 0. 
Part (b): Suppose that f is odd, then for n > 1 we get 


An = ef. f(x) cos(“**) dx, 


but f is odd and the Cosine is even, so the integrand is odd. Therefore a, = 0. Finally 


L 
a= z |, fede, 
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but f is odd, hence ay = 0. This establishes the Theorem. 


1, for «€ [0,3] 


Example 7.2.5. Find the Fourier expansion of f(x) = 1. fo € [-3,0) 
—1, for «x € [—3,0). 


Solution: The function f is odd, so its Fourier series expansion 


a) 


Ao == NTL) | . 
fe(@) =F + an cos L ) + by sin( Z 


is actually a sine series. Therefore, all the coefficients a, = 0 for n > 0. So we only need to 
compute the coefficients b,. Since in our case LD = 3, we have 


bn = if. f(a) sin(*) dx 


° nTe 3 opnna 
=3(/ 1) sin( 3 ) dex | sin(“**) dx) 


= = (-(-1)"+1) > b= = ((-1)+) +1). 


Therefore, we get 


fr(zt)= > 2. (alr! a1) sin(“™*). 


Example 7.2.6. Find the Fourier series expansion of the function 


x «€ (0,1), 
f(z) = —x « €[-1,0). 


Solution: Since f is even, then b, = 0. And since L = 1, we get 


f(x) = = + ys Gy cos(nr2), 


n=1 


We start with a). Since f is even, dp is given by 


1 1 21 
x 
ay =2 | flv) de=2 f xde = 2=| > a=l. 
0 0 2 Io 
Now we compute the a,, for n > 1. Since f and the cosines are even, so is their product, 
1 
An = 2 | x cos(nma) da 
0 
Gs 1 1 
= 2(= sin(ntz) + cos(nz)) | 
NT NT 0 


= — (cos(n7) - 1) > @=—> iar _ 1). 
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So, 


f(x) = ; + Lm = ((—1)" — 1) cos(nmz). 


n=1 


Example 7.2.7. Find the Fourier series expansion of the function 


l-« veE(0,]] 
f(z) = l+a a2 €[-1,0). 


Solution: Since f is even, then b, = 0. And since L = 1, we get 


f(x) = > + S- Ap, cos(nTx), 


n=1 


We start computing ao, 


= [ (+ade+ [ 0-2)ae 


xe 19 a-\ 1 
7 aie (2 wi 


| 
an 


Similarly, 
1 
Gn =i f(x) cos(nra) dx 
-1 
0 1 
=) (1+ 2) cos(naa) dx +f (1 — 2x) cos(nra) dx. 
-1 0 
Recalling the integrals 
1 
J cos(nza) dx = — sin(nra), 


nT 


fe cos(nra) dx = = sin(nma) + cos(nrx), 


nt ne 
it is not difficult to see that 


it 0 xz 0 

Qn = = sin(nre)| + |= sin(n7tax) + 7272 cos(ne)| , 
1 1 x 1 

+ — sin(na)| - [= sin(ntz) + ——> cos(nz)| | 
NT 0 nT NAT 0 


1 1 1 1 
~ Fe nn? aig n)| [a conn) ra 


we then conclude that 


an = 1 —cos(—n)] = 3 (1 — (-1)”). 


ne nn 
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So, 


f(z) = ; at E> 2 5(1—(-1)") cos(nrz). 


J 


7.2.5. Applications. The Fourier series expansion is a powerful tool for signal anal- 
ysis. It allows us to view any signal in a different way, where several difficult problems are 
very simple to solve. Take sound, for example. Sounds can be transformed into electri- 
cal currents by a microphone. There are electric circuits that compute the Fourier series 
expansion of the currents, and the result is the frequencies and their corresponding ampli- 
tude present on that signal. Then it is possible to manipulate a precise frequency and then 
recombine the result into a current. That current is transformed into sound by a speaker. 

This type of sound manipulation is very common. You might remember the annoying 
sound of the vuvuzelas—kind of loud trumpets, plastic made, very cheap—in the 2010 
soccer world championship. Their sound drowned the tv commentators during the world 
cup. But by the 2014 world cup you could see the vuvuzelas in the stadiums but you did 
not hear them. It turns out vuvuzelas produce a single frequency sound, about 235 Hz. The 
tv equipment had incorporated a circuit that eliminated that sound, just as we described 
above. Fourier series expand the sound, kill that annoying frequency, and recombine the 
sound. 

A similar, although more elaborate, sound manipulation is done constantly by sound 
editors in any film. Suppose you like an actor but you do not like his voice. You record 
the movie, then take the actor’s voice, compute its Fourier series expansion, increase the 
amplitudes of the frequencies you like, kill the frequencies you do not like, and recombine 
the resulting sound. Now the actor has a new voice in the movie. 


Fourier transform are used in image analysis 
too. A black and white image can be thought 
as a function from a rectangle into the set 
{0,1}, that is, pixel on or pixel off. We can 
now write this function of two variables as a 
linear combination of sine and cosine functions 
in two space dimensions. Then one can ma- 
nipulate the individual frequencies, enhancing 
some, decreasing others, and then recombine 
the result. In Fig. 7 we have an image and its 
Fourier transform. In Fig 8 and 9 we see the 
effect on the image when we erase high or low 
frequency modes. 


FiIGuRE 7. Animage and its 
Fourier transform. 
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FIGURE 8. Image after eras- FIGURE 9. Image after eras- 
ing high frequency terms. ing low frequency terms. 
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7.2.1.- . 
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7.3. The Heat Equation 


We now solve our first partial differential equation—the heat equation—which describes the 
temperature of a material as function of time and space. This is a partial differential equation 
because it contains partial derivatives of both time and space variables. We solve this 
equation using the separation of variables method, which transforms the partial differential 
equation into two sets of infinitely many ordinary differential equations. One set of ODEs 
are initial value problems, while the other set are eigenfunction problems. 

The Heat equation has infinitely many solutions. One can get a unique solution imposing 
appropriate boundary and initial conditions. We solve the heat equation for two types of 
boundary conditions, called Dirichlet and Neumann conditions. 


7.3.1. The Heat Equation (in One-Space Dim). We start introducing the heat 
equation, for simplicity, in one-space dimension. 


Definition 7.3.1. The heat equation in one-space dimension, for the function u depending 
ont and x is 


O,u(t,2) =kO2u(t,xz), for t€[0,cc), x € (0, LZ], 


where k > 0 is a constant and 0;, Ox are partial derivatives with respect to t and x. 


Remarks: 


u is the temperature of a solid material. 

t is a time coordinate, while x is a space coordinate. 

k > 0 is the heat conductivity, with units [k] = [2]*/[¢]. 

The partial differential equation above has infinitely many solutions. 

We look for solutions satisfying both boundary conditions and initial conditions. 


Insulation 


u(0,2) = f(x) * 
= Insulation 
FIGURE 11. Sketch of the 
FicuRE 10. A solid bar initial-boundary value prob- 
thermally insulated on the lem on the ta-plane. 


four blue sides. 


The heat equation contains partial derivatives with respect to time and space. Solving 
the equation means to do several integrations, which means we have a few arbitrary inte- 
gration constants. So the equation has infinitely many solutions. We are going to look for 
solutions that satisfy some additional conditions, known as boundary conditions and initial 
conditions. 


u(t,0) = 0, 


Boundary Conditions: it) 0: 


Initial Conditions: f 
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We are going to try to understand the qualitative behavior of the solutions to the heat 
equation before we start any detailed calculation. Recall that the heat equation is 


The meaning of the left and hand side of the equation is the following: 


How fast the temperature ‘ ' The concavity of the graph of u 
increases of decreases. eae 


in the variable x at a given time. 


Suppose that at a fixed time t > 0 the graph of the temperature u as function of x is given by 
Fig. 12. We assume that the boundary conditions are u(t,0) = T) = 0 and u(t, L) = Ty > 0. 
Then the temperature will evolve in time following the red arrows in that figure. 

The heat equation relates the time variation of the temperature, O,u, to the curvature 
of the function u in the x variable, 02u. In the regions where the function u is concave up, 
hence 02u > 0, the heat equation says that the tempreature must increase 0;u > 0. In the 
regions where the function wu is concave down, hence 0?u < 0, the heat equation says that 
the tempreature must decrease Qu < 0. 

Therefore, the heat equation tries to make the temperature along the material to vary 
the least possible that is consistent with the boundary conditions. In the case of the figure 
below, the temperature will try to get to the dashed line. 


Oa Oxu <0 ae 


0 t fixed L x 
FIGURE 12. Qualitative behavior of a solution to the heat equation. 


Before we start solving the heat equation we mention one generalizations and and a 
couple of similar equations. 


e The heat equation in three space dimensions is 
Oyu = k (O2ut+ Oru + A?u). 


The method we use in this section to solve the one-space dimensional equation can 
be generalized to solve the three-space dimensional equation. 
e The wave equation in three space dimensions is 


O?u = v? (Zu + Ou + 0?u). 


This equation describes how waves propagate in a medium. The constant v has 
units of velocity, and it is the wave speed. 
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e The Schrédinger equation of Quantum Mechanics is 


2 


5 (Ont + Oyu + Ozu) + V(t, x) u, 


where m is the mass of a particle and fi is the Planck constant divided by 27, while 


i? = —1. The solutions of this equation behave more like the solutions of the wave 


equation than the solutions of the heat equation. 


7.3.2. The IBVP: Dirichlet Conditions. We now find solutions of the one-space 
dimensional heat equation that satisfy a particular type of boundary conditions, called 
Dirichlet boundary conditions. These conditions fix the values of the temperature at two 
sides of the bar. 


Theorem 7.3.2. The boundary value problem for the one space dimensional heat equation, 
Ou = k O2u, BO. gh at aft) =a, 


where k > 0, L > 0 are constants, has infinitely many solutions 
= 2 NTx 
u(t, z) = do goer sin(“**), Cn ER. 


Furthermore, for every continuous function f on [0, L] satisfying f(0) = f(L) = 0, there is a 
unique solution u of the boundary value problem above that also satisfies the initial condition 
u(0,2) = f(z). 


This solution u is given by the expression above, where the coefficients cy, are 


= 2f f(a) sin(“*) dx. 


Remarks: This is an Initial-Boundary Value Problem (IBVP). The boundary conditions 
are called Dirichlet boundary conditions. The physical meaning of the initial-boundary 
conditions is simple. 

(a) The boundary conditions is to keep the temperature at the sides of the bar is constant. 
(b) The initial condition is the initial temperature on the whole bar. 


The proof of the IBVP above is based on the separation of variables method: 
(1) Look for simple solutions of the boundary value problem. 
(2) Any linear combination of simple solutions is also a solution. (Superposition.) 
(3) Determine the free constants with the initial condition. 
Proof of the Theorem 7.3.2: Look for simple solutions of the heat equation given by 


u(t, x) = v(t) w(a). 


So we look for solutions having the variables separated into two functions. Introduce this 
particular function in the heat equation, 
1 o(t)  w"(z) 
k v(t) w(x)’ 
where we used the notation t = du/dt and w’ = dw/dz. The separation of variables in the 
function u implies a separation of variables in the heat equation. The left hand side in the 
last equation above depends only on ¢ and the right hand side depends only on x. The only 
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possible solution is that both sides are equal the same constant, call it —A. So we end up 
with two equations 


1 * t 1 
eee and i a 
k v(t) w(x) 
The equation on the left is first order and simple to solve. The solution depends on 4, 
v(t) = ce, e ** Cy = vy(0). 


The second equation leads to an eigenfunction problem for w once boundary conditions are 
provided. These boundary conditions come from the heat equation boundary conditions, 


u(t, 0) = v(t) w(0) =0 for allt > 0 
u(t, L) = v(t) w(L) =0 for allt > i w(0) = w(L) = 0. 


So we need to solve the following BVP for w; 
w"+rAw =0, w(0) = w(L) = 0. 
This is an eigenfunction problem we solved § 7.1, the solution is 


2 
du = (=), wn(z) = sin(“*), eee 


Since we now know the values of A,,, we introduce them in vy, 
Un(t) = Cn grr, 
Therefore, we got a simple solution of the heat equation BVP, 


ie Tx 
Un(t, 2) = Cn eo h(E) sin(“*) 


a 


where n = 1,2,---. Since the boundary conditions for u,;, are homogeneous, then any linear 
combination of the solutions u,, is also a solution of the heat equation with homogenous 
boundary conditions. Hence the function 


(22) ; NTL 
>> Cyn € L ® sin ie 


is solution of the heat equation with eee Dirichlet boundary conditions. Here the 
Cn are arbitrary constants. Notice that at t = 0 we have 


- (Nx 
“) = d Cn, sin(“**) 
If we prescribe the c, we get a solution u that at t = 0 is given by the previous formula. Is 
it the converse true? The answer is “yes”. Given f(x) = u(0,2), where f(0) = f(L) = 0, 
we can find all the coefficients c,. Here is how: Given f on [0, Z], extend it to the domain 
[—L, L] as an odd function, 


foaa(x) = f(x) and foaa(—2) = —f (2), x € [0, L} 


Since f(0) = 0, we get that foaq is continuous on [—L,L]. So foaq has a Fourier series 
expansion. Since foaq is odd, the Fourier series is a sine series 


foaa(x -> br sin( a) 


and the coefficients are given by the eae 


=4f fraale )sin(“** a) =? f 40) si a( =) de. 
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Since foaa(x) = f(x) for x € [0, L], then c, = by. This establishes the Theorem. 


Example 7.3.1. Find the solution to the initial-boundary value problem 
40,u=O2u, t>0, x € [0,2], 


with initial and boundary conditions given by 


0 we [0, *), 
24 u(t, 0) = 0, 
IC: u(0,2)=45 we larg ls BC: i 2) =0. 
4 
0 «we (3), 


Solution: We look for simple solutions of the form u(t, x) = u(t) w(z), 


wl FO=00F5@ + SO-2_-y 


So, the equations for v and w are 
o(t) = = v(t), w' (2) +Aw(x) = 0. 
The solution for v depends on \, and is given by 
u(t) = cy e3t, Cy = v,(0). 
Next we turn to the the equation for w, and we solve the BVP 
w' (#4) +Aw(xz) =0, with BC w(0) = w(2) = 0. 


This is an eigenfunction problem for w and ». This problem has solution only for A > 0, 
since only in that case the characteristic polynomial has complex roots. Let \ = pi”, then 


pr) =r? +2 =0 re = pt. 


The general solution of the differential equation is 
Wn(x) = c, cos(px) + cy sin(ux). 


The first boundary conditions on w implies 


0=w(0) =a, w(x) = c, sin(px). 


The second boundary condition on w implies 
0 = w(2) =c sin(w2), co A#0, => sin(u2) =0. 
Then, Un2 = n7, that is, un = a Choosing c, = 1, we conclude, 


2 
An = (+) ; W(x) = sin(“"*), n=1,2,:--- 


Using the values of A, found above in the formula for v, we get 
Un(t) = cn ew *F)*, ey, = n(0). 


Therefore, we get 


u(t, x) = » Cn eR) sin(*). 
n=1 
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The initial condition is 


0 2 (0,4), 
f(x) =u(0,2) = <5 ze [Sc], 
0 ne (5,2] 


We extend this function to [—2,2] as an odd function, so we obtain the same sine function, 
foaa(x) = f(a) and foaa(—x) =—f(x), where 2 € (0,2). 


The Fourier expansion of foaq on [—2, 2] is a sine series 


foaa(x) = 3 bn sin(“*). 
n=1 


The coefficients b, are given by 
4/3 


— i fle)sin(™Z) de =f 5sin( $2) ae = 2 con(*t) 


So we get 
bn = “ (cos(*2) cos(“*)). 


Since foaa(x) = f(x) for x € [0,2] we get that c, = by. So, the solution of the initial- 
boundary value problem for the heat equation contains is 


1001 2n ne y2 
u(t, v) = = - — (cos (= ) cos( —)) oat sin(“*). 
n=1 


J 


7.3.3. The IBVP: Neumann Conditions. We now find solutions of the one-space 
dimensional heat equation that satisfy a particular type of boundary conditions, called 
Neumann boundary conditions. These conditions fix the values of the heat flux at two sides 
of the bar. 

Theorem 7.3.3. The boundary value problem for the one space dimensional heat equation, 


Qu=kd2u, BC: Od,u(t,0)=0, Opu(t,L) =0, 


where k > 0, L > 0 are constants, has infinitely many solutions 
ie NTL 
u(t, x) = 3 + den male a *cos(“**), Cn ER. 


Furthermore, for every continuous function f on [0,L] satisfying f’(0) = f’(L) = 0, there 
is a unique solution u of the boundary value problem above that also satisfies the initial 
condition 


ai(O,ce) = fla) 


This solution u is given by the expression above, where the coefficients cy, are 


2 L 
n= Ff feyeos(™*) de, m= 0,12, 
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Remarks: This is an Initial-Boundary Value Problem (IBVP). The boundary conditions 
are called Neumann boundary conditions. The physical meaning of the initial-boundary 
conditions is simple. 


(a) The boundary conditions keep the heat flux (proportional to 0,u) at the sides of the 
bar is constant. 
(b) The initial condition is the initial temperature on the whole bar. 


One can use Dirichlet conditions on one side and Neumann on the other side. This is 
called a mixed boundary condition. The proof, in all cases, is based on the separation of 
variables method. 


Proof of the Theorem 7.3.3: Look for simple solutions of the heat equation given by 
u(t, x) = v(t) w(a). 


So we look for solutions having the variables separated into two functions. Introduce this 
particular function in the heat equation, 

1 o(t 

k v(t) w(x)’ 

where we used the notation t = dv/dt and w’ = dw/dx. The separation of variables in the 
function u implies a separation of variables in the heat equation. The left hand side in the 
last equation above depends only on ¢ and the right hand side depends only on x. The only 
possible solution is that both sides are equal the same constant, call it —A. So we end up 
with two equations 


) _ w"(2) 


OE) wie) = kee) we) => 


1 < i 
1H) ng MO 
k v(t) w(x) 
The equation on the left is first order and simple to solve. The solution depends on 4, 
u(t) = cy et, Cy = vy(0). 


The second equation leads to an eigenfunction problem for w once boundary conditions are 
provided. These boundary conditions come from the heat equation boundary conditions, 


0,u(t, 0) = v(t) w'(0) =0 for allt > 0 
; w’ (0) = w'(L) = 0. 
0,u(t, L) = v(t) w'(L) =0 for allt > 0 
So we need to solve the following BVP for w; 
w’+rAw =0, w' (0) = w'(L) = 0. 


This is an eigenfunction problem, which has solutions only for A > 0, because in that case 
the asociated characteristic polynomial has complex roots. If we write \ = py, for pp > 0, 
we get the general solution 


w(x) = c, cos(ua) + c) sin(ux). 
The boundary conditions apply on the derivative, 
w' (x) = —pc, sin(a) + pc, cos(px). 


The boundary conditions are 


0 = w'(0) = pee C = 0. 
So the function is w(x) = uc; cos(yx). The second boundary condition is 


0=w'(L)=—pe,sin(ulL) => sin(ulL)=0 3S p,L=nz, nm=1,2,---. 
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So we get the eigenvalues and eigenfunctions 


2 
m=(F), wn (2) = cos(“*), n=1,2,--- 


Since we now know the values of A, we introduce them in vy, 
Un(t) = Cn eo MEY t, 
Therefore, we got a simple solution of the heat equation BVP, 


Un (t, 2) = Cy e BE)"* cos("**) ; 


where n = 1,2,---. Since the boundary conditions for u, are homogeneous, then any linear 
combination of the solutions u, is also a solution of the heat equation with homogenous 
boundary conditions. Hence the function 


is solution of the heat equation with homogeneous Neumann boundary conditions. Notice 
that the constant solution c)/2 is a trivial solution of the Neumann boundary value problem, 
which was not present in the Dirichlet boundary value problem. Here the c, are arbitrary 
constants. Notice that at t = 0 we have 


u(0, 2) = 5 + ~ Cn cos“) 
n=1 


If we prescribe the c,, we get a solution u that at t = 0 is given by the previous formula. Is 
it the converse true? The answer is “yes”. Given f(x) = u(0,x), where f’(0) = f’(L) = 0, 
we can find all the coefficients c,. Here is how: Given f on [0, Z], extend it to the domain 
[—L, L] as an even function, 


feven(Z) = f (2) and feven(—2) = F(z), LE (0, L| 


We get that feven is continuous on [—L,L]. So feven has a Fourier series expansion. Since 
feven is even, the Fourier series is a cosine series 


os ao ie NTL 
feven(£) = 5) + Dian cos(“F ) 


and the coefficients are given by the formula 


a ig NTL Va hag NTL 
an = an Jovan 2) cos(“**) dx = al F(z) cos(“Z*) dz, = 0,1,2,-- _ 


Since foven(z) = f(x) for x € [0, L], then c, = a,. This establishes the Theorem. 


Example 7.3.2. Find the solution to the initial-boundary value problem 
Ou = O02u, t>0, xe [0,3], 


with initial and boundary conditions given by 


3 
7 ve[5,3), u'(t,0) =0, 

IC: u(0,2) = 3 BC: 
0 ze (0,5), 3) = 0. 
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Solution: We look for simple solutions of the form u(t, 2) = v 


(t) w(x), 
ae ato v(t) (0) — “ i . 
So, the equations for v and w are 


o(t) = —Av(t), w(x) +Aw(z) =0. 
The solution for v depends on 4, and is given by 


v(t) =c,e*, Cy = v)(0). 
Next we turn to the the equation for w, and we solve the BVP 
w'(z)+Aw(rz)=0, withBC w’'(0) =w’'(3) =0. 

This is an eigenfunction problem for w and ». This problem has solution only for A > 0, 
since only in that case the characteristic polynomial has complex roots. Let \ = p?, then 

p(r) =r? +p? =0 re = tpi. 
The general solution of the differential equation is 


Wn(a) = c, cos(ux) + c sin(ux). 
Its derivative is 


w' (x) = —pe, sin(ux) + wc, cos(px). 
The first boundary conditions on w implies 


0=w' (0N)=peQ, > G©=0 3S w(x) = c, cos(uz). 
The second boundary condition on w implies 
0=w'(3) = —pe, sin(u3), ¢c,40, = — sin(w3) =0. 


-_ Choosing c, = 1, we conclude, 
2 
An = (=) ; W(x) = cos(“* 


; ) ean eee 
Using the values of A, found above in the formula for v, we get 


Then, Un3 = nz, that is, uy 


Un(t) = tne Et, 


Cn = Un (0). 
Therefore, we get 


uli, 2) = : + +o et cos(“ =), 
n=1 


where we have added the trivial constant solution written as co/2. The initial condition is 


7 “£e€ ee 
(x) = u(0,2) = oe 
0 «ze (0, a) 
We extend f to [—3,3] as an even function 
3 
7 =30 
xe [5,3], 
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Since frmeven is even, its Fourier expansion is a cosine series 
co 
Ao NTx 
feven(Z) = — + ) Gn, cos{ —— ). 
2 3 
yf 


The coefficient a) is given by 


2.3 . (¢nmzy}3 
= ~7— sin(“**) | 
nT 3 3/2 


7 eo (0 sin(F)) 


2. 
=-—7 oe sin(n7). 


But for n = 2k we have that sin(2k7/2) = sin(ka) = 0, while for n = 2k — 1 we have that 
sin((2k — 1)7/2) = (—1)*-1. Therefore 

2(—1)* 
(2k — 1)r 
We then obtain the Fourier series expansion of feyen, 


Ty 2D gg ( 2 = Dre 
feven(£) _ 2 + ite: = 1)r cos( 3 ) 


aor = 0, a2p—-1 = 7 pe Ly dyes 


But the function f has exactly the same Fourier expansion on [0,3], which means that 


2(-1)' 
Sak—1) = Toe aR 


So the solution of the initial-boundary value problem for the heat equation is 


7 O°. -2(-1)¥ 9 ee-yry2 (2k — 1)rx 
t,c)=s+7) ——_e“' ve a ess 
u(t, x) 5 + 2. (Qk — 1m e 3 cos( ) 


Co = 7, cor = 0, 


3 


Example 7.3.3. Find the solution to the initial-boundary value problem 
Ou=402u, t>0, x € (0,2), 
with initial and boundary conditions given by 


u(0,2) = 3sin(7a/2), u(t,0)=0, u(t,2) =0. 


Solution: We look for simple solutions 


u(t, x) = v(t) w(a). 
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We put this function into the heat equation and we get 
o(t) _ w(x) 


w(x) d(t) =40(t)w" (2) => ie. aes ie 


The equations for v and w are 
v(t) = —4A v(t), w' (rz) +Aw(z) = 0. 


We solve for v, which depends on the constant A, and we get 
v(t) =c,e™, 
where c, = v)(0). Next we turn to the boundary value problem for w. We need to find the 
solution of 
w" (2) +Auw(x2) =0, with w(0) = w(2)=0. 

This is an eigenfunction problem for w and A. From § 7.1 we know that this problem has 
solutions only for A > 0, which is when the characteristic polynomial of the equation for 
w has complex roots. So we write \ = p? for > 0. The characteristic polynomial of the 
differential equation for w is 


pir) =r? +? =0 re = pi. 


The general solution of the differential equation is 
w(x) = & cos(uxr) + & sin(ux). 


The first boundary conditions on w implies 


0=w(0) =, w(x) =  sin(ux). 


The second boundary condition on w implies 
0 = w(2) = sin(u2), & 40, => sin(u2)=0. 


Then, 1,2 = nz, that is, u, = n7/2, for n > 1. Choosing ¢, = 1, we conclude, 


2 
Am = (+) F wn(x) = sin(“2=), n=1,2,--- 


Using these .,, in the expression for v, we get 
Un (t) = Cp e407)" 
The expressions for v, and w,, imply that the simple solution solution u,, has the form 


ee etn) i (=). 
Un(t, x) ae e€ sin( > 


n=1 


Since any linear combination of the function above is also a solution, we get 
[oe) 
= 24. (NTx 
u(t, x) = ‘> net sin(“*). 
n=1 
The initial condition is 
Co 
_ (TX (nme 
3 sin(™*) a de sin(“*). 


We now consider this function on the interval [—2, 2], where is an odd function. Then, the 
orthogonality of these sine functions above implies 


2 [/s0(%) sin) ae = Seu f sin(S*) sin) ae 
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We now study the different cases: 
Cm =0 for mA. 
Therefore we get, 


3sin(—) = sin(=) > ag=3. 
So the solution of the initial-boundary value problem for the heat equation is 


TL 


u(t, 2) = acer sin(=). 
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7.3.4. Exercises. 


7.3.1.- . 7.3.2.- . 


CHAPTER 8 


Review of Linear Algebra 


We review a few concepts of linear algebra, such as the Gauss operations to solve linear 
systems of algebraic equations, matrix operations, determinants, inverse matrix formulas, 
eigenvalues and eigenvectors of a matrix, diagonalizable matrices, and the exponential of a 
matrix. 


341 
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8.1. Linear Algebraic Systems 


This Section is a review of concepts from Linear Algebra needed to solve systems of linear 
differential equations. We start introducing an algebraic linear system, we then introduce 
matrices, column vectors, matrix-vector products, Gauss elimination operations, matrix 
echelon forms, and linear independence of vector sets. 


8.1.1. Systems of Linear Equations. The collection of results we call Linear Alge- 
bra originated with the study of linear systems of algebraic equations. 


Definition 8.1.1. An n x n system of linear algebraic equations is the following: 


Given constants aj; and b;, where i,j =1--- ,n > 1, find the constants x; solutions of 
411%, + +++ + Aintn = bi, (8.1.1) 
Qnif1 tess + Anntn = bn. (8.1.2) 
The system is called homogeneous iff all sources vanish, that is, b} =--- = b, = 0. 


Example 8.1.1. 


(a) A 2x 2 linear system on the unknowns 2, and 2, is the following: 


221 — &) = 0, 
—2%,+ 22, = 3. 


(b) A 3x 3 linear system on the unknowns 2,, x, and 2; is the following: 


v4 22> + v3 = 1, 
—32,+%,+ 30, = 24, 


Lp — 4X3 =—l. 4 


One way to find a solution to an n x n linear system is by substitution. Compute 2, 
from the first equation and introduce it into all the other equations. Then compute x, from 
this new second equation and introduce it into all the remaining equations. Repeat this 
procedure till the last equation, where one finally obtains x,. Then substitute back and 
find all the x;, for i = 1,--- ,n —1. A computational more efficient way to find a solution 
is to perform Gauss elimination operations on the augmented matrix of the system. Since 
matrix notation will simplify calculations, it is convenient we spend some time on this. We 
start with the basic definitions. 


Definition 8.1.2. Anim x n matriz, A, is an array of numbers 


Qt *** Alin 
mM rows, 
A = tJ ? 
n columns, 
Gm1 *"* amn 
where aij € C, fori = 1,---,m, j = 1,---,n, are the matrix coefficients. A square 


matriz is ann xn matriz, and the diagonal coefficients in a square matrix are ai. 


Example 8.1.2. 
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(a) Examples of 2 x 2, 2 x 3, 3x 2 real-valued matrices, and a 2 x 2 complex-valued matrix: 


12 
oi: 2 ft 2.3 _ _fiti 2-3 
A=(5 ‘li Balt sal: eas p=|"3 Pat 


(b) The coefficients of the algebraic linear systems in Example 8.1.1 can be grouped in 
matrices, as follows, 


%,+22,+2;=1, 1 9) 

2x41 — X= 0, Dm asi 
bs a=[ |. Betas +82 = 24, => A= |-3 1 3 
UT — 4%, = —1. 0 1 


Remark: A square matrix is upper (lower) triangular iff all the matrix coefficients below 
(above) the diagonal vanish. For example, the 3 x 3 matrix A below is upper triangular 
while B is lower triangular. 


i 23 100 
A=|0 4 5], Be=]2 3 0 
00 6 4 5 6 


The particular case of an m x 1 matrix is called an m-vector. 
U1 

Definition 8.1.3. An m-vector, v, is the array of numbers v= | : |, where the vector 
Um 


components are vy; © C, with i =1,--- ,m. 


Example 8.1.3. The unknowns of the algebraic linear systems in Example 8.1.1 can be 
grouped in vectors, as follows, 


%,+2%,+23;=1, 


241 —%2 = 0, r v1 

> t= F —3r,+ 2%, +323 = 24, => g= |x 

—%1 + 2% = 3, XL oe a . 
2 3 — 4. 


dq 
Definition 8.1.4. The matrix-vector product of ann xn matrix A and an n-vector x 
is an n-vector given by 


Git *** Gin vy Q11%1 T+ + Aintn 
Ax= : , = 


anti “7 ann In Ani V1 oid Gintn 


The matrix-vector product of an n x n matrix with an n-vector is another n-vector. This 
product is useful to express linear systems of algebraic equations in terms of matrices and 
vectors. 


Example 8.1.4. Find the matrix-vector products for the matrices A and vectors x in 
Examples 8.1.2(b) and Example 8.1.3, respectively. 
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Solution: In the 2 x 2 case we get 
— | 2 1} J}ay} — | 201-2, 
ao-|4 a} |el-Feva] 


In the 3 x 3 case we get, 


1 2 1 D4 Ly + 2x. + Ly 
Az = —3 1 3 Lo) = —32; + Xo + 323 
0 1 —-4 V3 Ly — 4X3 


dq 


Example 8.1.5. Use the matrix-vector product to express the algebraic linear system be- 


low, 


224 — t= 0, 
—2, + 2x9 = 3. 


Solution: Introduce the coefficient matrix A, the unknown vector x, and the source vector 


b as follows, 
_ 2 —l — |e _ 0 


Since the matrix-vector product Az is given by 
= 2 —l Ty) 224 — v9 
em | 2] fal =[2tcza], 
then we conclude that 


2x21 — %2 = 0, 241 — x2 0 
= Az = b. 
—% + 2% = | = i + sa | | = i 


<q 


It is simple to see that the result found in the Example above can be generalized to every 
n xX n algebraic linear system. 


Theorem 8.1.5 (Matrix Notation). The system in Eqs. (8.1.1)-(8.1.2) can be written as 


Ag = 6: 
where the coefficient matrix A, the unknown vector x, and the source vector b are 
G11 *** Gin vy by 
A= | 2 Sig atte , v= 
anti wee ann In bn 


Proof of Theorem 8.1.5: From the definition of the matrix-vector product we have that 
@11 *** Gin x4 Q11%,4 +++ + A1ntn 


ani = ann In Ani Li +++ + A1ntn 


Then, we conclude that 


Q11%, +++ + Gintn = by, Qt, +o + Qind bi 
nen 


= Axr=b. 


t 
I 


An1X1 t:++ + anntn = bn, Ani®, +++ + Ann bn 
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We introduce one last definition, which will be helpful in the next subsection. 
Definition 8.1.6. The augmented matrix of Ax = b is then x (n+ 1) matrix [A|b). 


The augmented matrix of an algebraic linear system contains the equation coefficients and 
the sources. Therefore, the augmented matrix of a linear system contains the complete 
information about the system. 


Example 8.1.6. Find the augmented matrix of both the linear systems in Example 8.1.1. 


Solution: The coefficient matrix and source vector of the first system imply that 


2 —-1 0 2 —-1 0 
4-|2 a) ef] = wo-[4 o | of 
The coefficient matrix and source vector of the second system imply that 
I 2 4 1 i 2 gi il 
A= ]-3 1 3], b= | 24 => {AljbJl)=}]-3 1 3 24 
0 1 -4 -1 0 1 -4 -1 


dq 


Recall that the linear combination of two vectors is defined component-wise, that is, given 
any numbers a,b € R and any vectors a, y, their linear combination is the vector given by 


ax, + by Ly Yt 
ax+ by = : , where =]: |], y=]: 
ALn, + bYn In Yn 
With this definition of linear combination of vectors it is simple to see that the matrix-vector 
product is a linear operation. 
Theorem 8.1.7 (Linearity). The matrix-vector product is a linear operation, that is, given 


ann xn matriz A, then for all n-vectors x, y and all numbers a,b € R holds 


A(ax+ by) =aAu+bAy. (8.1.3) 


Proof of Theorem 8.1.7: Just write down the matrix-vector product in components, 
a1 +: Gin] far,+by, ay (ax, + by.) + +++ + Gin(ary + byn) 
A(ax+by)= | : : = 
Gm ‘** Amn} |A%y + byn Ani (ax, + by,) + +++ + Ann (arn + bYn) 


Expand the linear combinations on each component on the far right-hand side above and 
re-order terms as follows, 


a (@11%, + +++ + @intn) +O (airy: +++ + GinYn) 
A(ax + by) = : 


a (Qn1X1 ie Gnntn.) ely b (Any nies eat AnnYn) 
Separate the right-hand side above, 


(a2; we diyiLr) (a1144 es AinYn) 
A(ax + by) =a ; +b 


(Qn1%1 ye Onn) (Qn1Y1 a AnnYn) 
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We then conclude that 
A(ax + by) =aAa+ bAy. 


This establishes the Theorem. 


8.1.2. Gauss Elimination Operations. We review three operations on an augmented 
matrix of a linear system. These operations change the augmented matrix of the system but 
they do not change the solutions of the system. The Gauss elimination operations were 
already known in China around 200 BC. We call them after Carl Friedrich Gauss, since he 
made them very popular around 1810, when he used them to study the orbit of the asteroid 
Pallas, giving a systematic method to solve a 6 x 6 algebraic linear system. 


Definition 8.1.8. The Gauss elimination operations are three operations on a matrix: 


(i) Adding to one row a multiple of the another; 
(it) Interchanging two rows; 
(itt) Multiplying a row by a non-zero number. 


These operations are respectively represented by the symbols given in Fig. 1. 


(i) ] *) (ii) a ) (iii) a —aF0 


FIGURE 1. A sketch of the Gauss elimination operations. 


As we said above, the Gauss elimination operations change the coefficients of the augmented 
matrix of a system but do not change its solution. Two systems of linear equations having 
the same solutions are called equivalent. It can be shown that there is an algorithm using 
these operations that transforms any n x n linear system into an equivalent system where 
the solutions are explicitly given. 


Example 8.1.7. Find the solution to the 2 x 2 linear system given in Example 8.1.1 using 
the Gauss elimination operations. 


Solution: Consider the augmented matrix of the 2 x 2 linear system in Example (8.1.1), 
and perform the following Gauss elimination operations, 


2 -1 | 0] ,[2 -1 | 0] _J2 -1 | 0 2-1 | Oo], 
-1 2 3 a 6 @ 3 6}? lo 1 2 


Oa OG ae | oh OF Agee aes 


Example 8.1.8. Find the solution to the 3 x 3 linear system given in Example 8.1.1 using 
the Gauss elimination operations 


J 


Solution: Consider the augmented matrix of the 3 x 3 linear system in Example 8.1.1 and 
perform the following Gauss elimination operations, 


1 2 1 1 12 1 1 12 1 1 
-3 1 3 24; > |0 7 6 27; > |0 1 —4 ae 
O 1 —-4 —1 0 1 -4 -1 0 7 6 27 
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10 9 3 10 9 3 io: ee | é t= —6, 
®t et: |) eal le a et. | tl age ad a 3| 3s 2 =3, 
0 0 34 34 00 1 1 001 1 sion 

<dJ 


In the last augmented matrix on both Examples 8.1.7 and 8.1.8 the solution is given ex- 
plicitly. This is not always the case with every augmented matrix. A precise way to define 
the final augmented matrix in the Gauss elimination method is captured in the notion of 
echelon form and reduced echelon form of a matrix. 


Definition 8.1.9. An mx n matrix is in echelon form iff the following conditions hold: 


(i) The zero rows are located at the bottom rows of the matria; 
(it) The first non-zero coefficient on a row is always to the right of the first non-zero 
coefficient of the row above it. 


The pivot coefficient is the first non-zero coefficient on every non-zero row in a matrix in 


echelon form. 


Example 8.1.9. The 6 x 8, 3x5 and 3 x 3 matrices given below are in echelon form, where 
the * means any non-zero number and pivots are highlighted. 


kK OK ok Ok OK Ok OK 
O O * * x * * x fe es a 2 
a ee ee 0 O * x x O * x 
000 00 0 « x}? 00000 ’ (0 # 
000 00 0 0 0 
000 00 0 0 0 z 
Example 8.1.10. The following matrices are in echelon form, with pivot highlighted. 
; i] P 3 ee 
0 1 0 4 -2 0 0 0 
dq 


Definition 8.1.10. An m x n matrix is in reduced echelon form iff the matrix is in 
echelon form and the following two conditions hold: 


(i) The pivot coefficient is equal to 1; 
it) The pivot coefficient is the only non-zero coefficient in that column. 
y 


We denote by E4 a reduced echelon form of a matrix A. 


Example 8.1.11. The 6 x 8, 3 x 5 and 3 x 3 matrices given below are in echelon form, 
where the * means any non-zero number and pivots are highlighted. 


0 O 0 


oCoCcoCoo Ff 
ooCoCcooeoo x 
SoCoCcOoO Ff 
oocrF oOo 
ooCocx* * * 
oo ao * * * 
oor oCo oO 
oox* * * * 
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Example 8.1.12. And the following matrices are not only in echelon form but also in 
reduced echelon form; again, pivot coefficients are highlighted. 


fe Boy. foro 
0 1 0 1 £5 00 0 
<q 


Summarizing, the Gauss elimination operations can transform any matrix into reduced 
echelon form. Once the augmented matrix of a linear system is written in reduced echelon 
form, it is not difficult to decide whether the system has solutions or not. 


Example 8.1.13. Use Gauss operations to find the solution of the linear system 


224 — = 0, 


ae 
pan 
Solution: We find the system augmented matrix and perform appropriate Gauss elimination 
operations, 
i 7 »| > [2 na | > [a rae 
1 1 1 
5. u 3 2 1 -1 0 O 1 


From the last augmented matrix above we see that the original linear system has the same 
solutions as the linear system given by 


221 = 09 = 0, 
= 1, 
Since the latter system has no solutions, the original system has no solutions. <J 
The situation shown in Example 8.1.13 is true in general. If the augmented matrix [A|}] of 
an algebraic linear system is transformed by Gauss operations into the augmented matrix 


[Alb] having a row of the form [0,--- , 0/1], then the original algebraic linear system Aa = b 
has no solution. 


Example 8.1.14. Find all vectors 6 such that the system Ax = b has solutions, where 


i <3. 3 by 
A=|-1 1 -21, b= |b 
a <i 4 bs 


Solution: We do not need to write down the algebraic linear system, we only need its 
augmented matrix, 


1 -2 3 by 1 -—2 3 by 

[A] b] = }|-l 1 —2 bo} > 10 —-1 1 bi + be| 3 
2 <1 3 bs 2 -1 3 bs 

1 -2 3 by 1 -2 3 by 

0 1 1 by bg > 10 1 —1 —b; = bg 

0 3 -8 bs — 2b; 0 O O bg + by + 3b2 
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Therefore, the linear system Ax = b has solu- 

tions <> the source vector satisfies the equa- bs 

tion holds b; + 3b2 + b3 = 0. n 
That is, every source vector 6 that lie on the 

plane normal to the vector n is a source vec- 

tor such that the linear system Ax = b has 

solution, where 


by 
<J 


8.1.3. Linearly Dependence. We generalize the idea of two vectors lying on the 
same line, and three vectors lying on the same plane, to an arbitrary number of vectors. 


Definition 8.1.11. A set of vectors {v,,--- , u,}, with k > 1 is called linearly dependent 
iff there exists constants c,,--- ,cp, with at least one of them non-zero, such that 

Cy Vy +++ + Cy VE = O. (8.1.4) 
The set of vectors is called linearly independent iff it is not linearly dependent, that is, 


the only constants c,,-+- , cx that satisfy Eq. (8.1.4) are given by c, =--: = cp =0. 


In other words, a set of vectors is linearly dependent iff one of the vectors is a linear combi- 
nation of the other vectors. When this is not possible, the set is called linearly independent. 


Example 8.1.15. Show that the following set of vectors is linearly dependent, 


1 3 —1 
{]2}. |2}.] 2] 4, 
3 1 5 


and express one of the vectors as a linear combination of the other two. 


Solution: We need to find constants c,, c., and c3 solutions of the equation 


1 3 -1 0 1 3 -1] Ja 0 
2) a+ 42) ot |] 2} g= 10 & 22 2 Co} + |0 
3 1 5 0 3 1 5 C3 0 
The solution to this linear system can be obtained with Gauss elimination operations, 

a | a ee Le -] 10 2 C, = — 2s, 

22 2);4)/0 -4 4}/>4]/0 1 -1)/ 5 ]0 1 -1 => Co = Cs, 

3 1 5 0 -8 8 0 1 -1 0 0 O C3 = free. 
Since there are non-zero constants C;, C2, C3 solutions to the linear system above, the vectors 
are linearly dependent. Choosing c; = —1 we obtain the third vector as a linear combination 
of the other two vectors, 

il 3 -1 0 —1 il 3 
2/2) —}2} —] 2] = JO oS 2] =2 |2]| —}2 
3 1 5 0 5 3 1 
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8.2. Matrix Algebra 


The matrix-vector product introduced in Section 8.1 implies that an n x n matrix A is a 
function A: R" —+ R”. This idea leads to introduce matrix operations, like the operations 
introduced for functions f : R — R. These operations include linear combinations of 
matrices; the composition of matrices, also called matrix multiplication; the inverse of a 
square matrix; and the determinant of a square matrix. These operations will be needed in 
later Sections when we study systems of differential equations. 


8.2.1. A Matrix is a Function. The matrix-vector product leads to the interpreta- 
tion that an n x n matrix A is a function. If we denote by R” the space of all n-vectors, we 
see that the matrix-vector product associates to the n-vector x the unique n-vector y= Aa. 
Therefore the matrix A determines a function A: R” > R”. 


Example 8.2.1. Describe the action on R? of the function given by the 2 x 2 matrix 


A A (8.2.1) 


Solution: The action of this matrix on an arbitrary element # € R? is given below, 


tet ol Elle] 


Therefore, this matrix interchanges the components x, and x, of the vector x. It can be 
seen in the first picture in Fig. 2 that this action can be interpreted as a reflection on the 
plane along the line x, = 2. <J 


FIGURE 2. Geometrical meaning of the function determined by the matrix 
in Eq. (8.2.1) and the matrix in Eq. (8.2.2), respectively. 


Example 8.2.2. Describe the action on R? of the function given by the 2 x 2 matrix 


A= ft | (8.2.2) 


Solution: The action of this matrix on an arbitrary element 2 € R? is given below, 


of Jel-Cel 
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In order to understand the action of this matrix, we give the following particular cases: 


0 —1} }1} _ |0 O —1} |1}_ |-1 0 -l1} |-1} | 0 
1 Of} JO] fly? 1 Of; fl) J Ty? 1 0 Oo}; J-l1}- 
These cases are plotted in the second figure on Fig. 2, and the vectors are called x, y and 


z, respectively. We therefore conclude that this matrix produces a ninety degree counter- 
clockwise rotation of the plane. <J 


An example of a scalar-valued function is f : R > R. We have seen here that an n xn matrix 
A is a function A: R” > R”. Therefore, one can think an n x n matrix as a generalization 
of the concept of function from R to R”, for any positive integer n. It is well-known how to 
define several operations on scalar-valued functions, like linear combinations, compositions, 
and the inverse function. Therefore, it is reasonable to ask if these operation on scalar- 
valued functions can be generalized as well to matrices. The answer is yes, and the study 
of these and other operations is the subject of the rest of this Section. 


8.2.2. Matrix Operations. The linear combination of matrices refers to the addition 
of two matrices and the multiplication of a matrix by scalar. Linear combinations of matrices 
are defined component by component. For this reason we introduce the component notation 
for matrices and vectors. We denote an m x n matrix by A = [Ajj], where A;; are the 
components of matrix A, with 7 = 1,---,m and j = 1,--- ,n. Analogously, an n-vector is 
denoted by v = [v,;], where v; are the components of the vector v. We also introduce the 
notation F = {R, C}, that is, the set F can be the real numbers or the complex numbers. 


Definition 8.2.1. Let A = [A;,;] and B = [Bij] be m x n matrices in F™” and a, b be 
numbers in F. The linear combination of A and B is also and m x n matriz in F™”, 
denoted asaA+bB, and given by 

The particular case where a = 6 = 1 corresponds to the addition of two matrices, and the 


particular case of b = 0 corresponds to the multiplication of a matrix by a number, that is, 


A+B=[Aj; + Bi], aA = [a Ajy). 


Example 8.2.3. Find the A+ B, where A = F ik B= E il 


Solution: The addition of two equal size matrices is performed component-wise: 


sepals il+[e a)=[or5 Gaulle a 


dq 
: 1 2 1 2 3 
Example 8.2.4. Find the A+ B, where A = F ‘| B= F 5 HF 
Solution: The matrices have different sizes, so their addition is not defined. <J 


Example 8.2.5. Find 2A and A/3, where A = > il: 


2 4 6 
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Solution: The multiplication of a matrix by a number is done component-wise, therefore 


1 
Lg 2 
p49 (lt 3-5] [2 & 10 A isi 3 5] _|3 3 
~“l2 4 6) [4 8 12)? 3. 3(|2 4 6] lo 4 
- 2 
_ < 


Since matrices are generalizations of scalar-valued functions, one can define operations 
on matrices that, unlike linear combinations, have no analogs on scalar-valued functions. 
One of such operations is the transpose of a matrix, which is a new matrix with the rows 
and columns interchanged. 


Definition 8.2.2. The transpose of a matrix A = [A;;| € F”” is the matrix denoted as 
AT = [(A™),i| € F"", with its components given by (A’),, =A. 


Example 8.2.6. Find the transpose of the 2 x 3 matrix A = E 2 | . 


2 4 6 


Solution: Matrix A has components Ajj with ¢ = 1,2 and j = 1,2,3. Therefore, its 
transpose has components (A‘),; = Aj;;, that is, AT has three rows and two columns, 


i 2 
A =|\3 4 
5 6 


If a matrix has complex-valued coefficients, then the conjugate of a matrix can be defined 
as the conjugate of each component. 


Definition 8.2.3. The complex conjugate of a matriz A = [Aj;] €¢ F™” is the matrix 
A= [ Aij] € Fm", 


Example 8.2.7. A matrix A and its conjugate is given below, 
fa O44 > it Beas 
a=|7, ale = A=|i ale 


Example 8.2.8. A matrix A has real coefficients iff A = A; It has purely imaginary 


dq 


coefficients iff A = —A. Here are examples of these two situations: 
1 2 = i 2 
A=(; Hl s A=(5 “|= 4 
a 2 = 1 —2i 
a= b il = a ee a ae 


Definition 8.2.4. The adjoint of a matrix A € F™” is the matrix A* = A’ eEF™™, 


Since (A)" = (A‘), the order of the operations does not change the result, that is why there 
is no parenthesis in the definition of A*. 


354 8. REVIEW OF LINEAR ALGEBRA 


Example 8.2.9. A matrix A and its adjoint is given below, 
1 By ein i 
fea |? = ah = 2 ap mal 


The transpose, conjugate and adjoint operations are useful to specify certain classes of 
matrices with particular symmetries. Here we introduce few of these classes. 


J 


Definition 8.2.5. Ann xn matrix A is called: 
(a) symmetric iff holds A = A’; 

(b) skew-symmetric iff holds A = —A’; 

(c) Hermitian iff holds A = A’*; 

(d) skew-Hermitian iff holds A = —A*. 


Example 8.2.10. We present examples of each of the classes introduced in Def. 8.2.5. 
Part (a): Matrices A and B are symmetric. Notice that A is also Hermitian, while B is 
not Hermitian, 


1 2 3 1 2+37 3 
A=|2 7 4|=AT, B=|24+3i 7 4i/ =BT. 
3.4 8 3 Ai 8 
Part (b): Matrix C is skew-symmetric, 
0 -2 3 0 2 —-3 
C=] 2 0 —4 = CT=|-2 0 4| =-C. 
—-3 #4 0 3-4 O 
Notice that the diagonal elements in a skew-symmetric matrix must vanish, since Cy; = —C}; 


in the case 1 = 7 means Cj; = —Cj;, that is, Ci; = 0. 
Part (c): Matrix D is Hermitian but is not symmetric: 


1 2+% 3 1 2-1 3 
D=|2-% 7 A+i > DP=|2+i 7 4-i| 4D, 
3. 4-i 8 3. 44% 8 
however, 
7 1 2+4 3 
D* =D = |2-i 7 4+%; =D 
3. 4-1 8 


Notice that the diagonal elements in a Hermitian matrix must be real numbers, since the 

condition Aij = Aji in the case i = J implies Ay = Au, that is, 2iIm(A;;) = Ai _ Ax = 0. 

We can also verify what we said in part (a), matrix A is Hermitian since A* = A’ =ATHA. 
Part (d): The following matrix E is skew-Hermitian: 


i 2+% —3 a —2+%4 3 
E=|-2+i Ti 4+i} => ET = ]2+i Ti —44i 
3 —-4+% 8 —3 4A+% 81 
therefore, 
ea oe —2-1 3 
EX =EB |2-i 7 4-i| =-E 


A skew-Hermitian matrix has purely imaginary elements in its diagonal, and the off diagonal 
elements have skew-symmetric real parts with symmetric imaginary parts. <J 
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The trace of a square matrix is a number, the sum of all the diagonal elements of the matrix. 


Definition 8.2.6. The trace of a square matrix A = [Ais] € F™”, denoted as tr (A) € F, 
is the sum of its diagonal elements, that is, the scalar given by tr (A) = Ai; +--:+ Ann. 


1 2 8 
Example 8.2.11. Find the trace of the matrix A= |4 5 6]. 
7 8 9 


Solution: We only have to add up the diagonal elements: 
tr(A)=14+54+9 = tr(A)=15. 
dq 


The operation of matrix multiplication originates in the composition of functions. We 
call it matrix multiplication instead of matrix composition because it reduces to the mul- 
tiplication of real numbers in the case of 1 x 1 real matrices. Unlike the multiplication of 
real numbers, the product of general matrices is not commutative, that is, AB 4 BA in 
the general case. This property reflects the fact that the composition of two functions is a 
non-commutative operation. 


Definition 8.2.7. The matrix multiplication of the m x n matrit A = [Ajj] and the 
nx l matric B = [Bjx], wherei = 1,---,m, j =1,---,n andk =1,--- ,é, is the mx ¢ 
matric AB given by 
(AB)ix = > Agha: (8.2.3) 
j=l 


The product is not defined for two arbitrary matrices, since the size of the matrices is 
important: The numbers of columns in the first matrix must match the numbers of rows in 


the second matrix. 
A times B defines AB 


mxn nx mx é 


Example 8.2.12. Compute AB, where A = E ee and B= b E|) 


Solution: The component (AB); = 4 is obtained from the first row in matrix A and the 
first column in matrix B as follows: 


a ale SJ-[f 3). @@+cn@=4 


The component (AB);2 = —1 is obtained as follows: 


2 le S|-{f 3), @@+cCna--: 


The component (AB)2; = 1 is obtained as follows: 


E Fy E Mi 5h: (-1)(8) + 2)(2) = 1; 


And finally the component (AB):2 = —2 is obtained as follows: 


ao alb Sl-( a]. ca@+@cn- 
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Example 8.2.13. Compute BA, where A = E | and B= E ‘I 


6 


Solution: We find that BA = E 


a . Notice that in this case AB 4 BA. <J 


Example 8.2.14. Compute AB and BA, where A = E and B = h : HF 


Solution: The product AB is 


4 3) j1 2 3 16 23 30 
AB=|) i r 5 = AB=|' 9 a 
The product BA is not possible. <J 


Example 8.2.15. Compute AB and BA, where A = Fi 4 and B= EF El; 


it 2b a i ad 
aB=|; se mle ai 
=f. 4) fi 2 0 0 
Ba=|y Ali lel ie 
Remarks: 


(a) Notice that in this case AB 4 BA. 
(b) Notice that BA = 0 but A#0 and B 0. 


Solution: We find that 


J 


8.2.3. The Inverse Matrix. We now introduce the concept of the inverse of a square 
matrix. Not every square matrix is invertible. The inverse of a matrix is useful to compute 
solutions to linear systems of algebraic equations. 


Definition 8.2.8. The matria I, € F™” is the identity matriz iff [,«2 = x for alla € F”. 


It is simple to see that the components of the identity matrix are given by 


Definition 8.2.9. A matriz A € F”” is called invertible iff there exists a matrix, denoted 
as A~', such that (A7")A =TI,, and A(A™*) — ew 


Example 8.2.16. Verify that the matrix and its inverse are given by 


2 2 4 ifs 
a=[} Ar = =aL4 si 
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Solution: We have to compute the products, 
a 2 1p es 2) 14-0 ae 
aay = |r [4 2/=a\o 4) — A= 
It is simple to check that the equation (A~') A = I. also holds. <J 


Theorem 8.2.10. Given a2 x 2 matrix A introduce the number A as follows, 


a=(° il, A = ad — be. 
ed 
The matriz A is invertible iff A £0. Furthermore, if A is invertible, its inverse is given by 
1{[d — 
= 
AU = re i | , (8.2.4) 


The number A is called the determinant of A, since it is the number that determines whether 
A is invertible or not. 


Example 8.2.17. Compute the inverse of matrix A = f 4 , given in Example 8.2.16. 


Solution: Following Theorem 8.2.10 we first compute A = 6 —4 = 4. Since A ¥ 0, then 
A7! exists and it is given by 
anal | 2 . 


4 | =1 2 
dq 
F : 1 2 
Example 8.2.18. Compute the inverse of matrix A = 3 6: 
Solution: Following Theorem 8.2.10 we first compute A = 6 —6 = 0. Since A = 0, then 
matrix A is not invertible. <J 


The matrix operations we have introduced are useful to solve matrix equations, where 
the unknown is a matrix. We now show an example of a matrix equation. 


Example 8.2.19. Find a matrix X such that AX B = I, where 
1 3 2 1 1 0 
4=fa} @=f a) t=f of 


Solution: There are many ways to solve a matrix equation. We choose to multiply the 
equation by the inverses of matrix A and B, if they exist. So first we check whether A is 
invertible. But 


=|) teases 20. 
2 1 
so A is indeed invertible. Regarding matrix B we get 
det(B) =|f 3] =4-1=340, 


so B is also invertible. We then compute their inverses, 


1 1 -8 1/2 -1 
-1_ of ea 
a == |3 rh " ak ap 
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We can now compute X, 


AXB=I = A \(AXB)B'=A™'IB*+ > X=A™"'B", 


Therefore, 
ye 1 1 -3;) 1/2 -1} 1 5. 
~ 5 |-2 1]/3 ]/-1 2] 2415 ]-5 4 
so we obtain 
1 if 
x=|3 5 
3 15 


J 


8.2.4. Computing the Inverse Matrix. Gauss operations can be used to compute 
the inverse of a matrix. The reason for this is simple to understand in the case of 2 x 2 
matrices, as can be seen in the following Example. 


Example 8.2.20. Given any 2 x 2 matrix A, find its inverse matrix, A~!, or show that the 
inverse does not exist. 


Solution: If the inverse matrix, A~! exists, then denote it as A~' = [a, a]. The equation 


A(A7!) = Ip is then equivalent to A[a, a] = | This equation is equivalent to 


1 0 
O ly 
solving two algebraic linear systems, 


sa=[], an=[] 


Here is where we can use Gauss elimination operations. We use them on both systems 


1 
0 


0 
il 


A A 


b 


However, we can solve both systems at the same time if we do Gauss operations on the 
bigger augmented matrix 

10 

O01] 


Now, perform Gauss operations until we obtain the reduced echelon form for [A|I2]. Then 
we can have two different types of results: 


A 


e If there is no line of the form [0,0|*,*] with any of the star coefficients non-zero, 
then matrix A is invertible and the solution vectors 21, #2 form the columns of the 
inverse matrix, that is, A~' = [ay, x9]. 

e If there is a line of the form [0,0|*, *] with any of the star coefficients non-zero, 
then matrix A is not invertible. <J 


2 
Example 8.2.21. Use Gauss operations to find the inverse of A = A ; ; 


Solution: As we said in the Example above, perform Gauss operation on the augmented 
matrix [A|J2] until the reduced echelon form is obtained, that is, 
0 a 
— 


22/10 13 |o1isf s 
i 3 | ¢@ a) ee | 4 o 0 <4 |4 3 
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fF 3 0 ‘ = k 0 | 3 e| 

1 1 1 1 
O 1 | -Z 3 Ol | -z7 3 
That is, matrix A is invertible and the inverse is 


1 2 8 
Example 8.2.22. Use Gauss operations to find the inverse of A= |2 5 7}. 
3.7 «9 


Solution: We perform Gauss operations on the augmented matrix [A|J3] until we obtain 
its reduced echelon form, that is, 


1 2 3 1 0 0 1 2 3 1 0 0 
25 7 01 0}>]/0 1 1 —2 1 0/ — 
3.7 «9 0 0 1 0 1 0 -3 0 1 
10 1 5 -2 0 101 5 -—2 0 
01 1 —2 1 OJ 3 j0 1 1 —2 1 0 
0 0 -1 -1 -1 1 00 1 1 1 -l 
1 01 5 -2 0O 1 0 0 4 -3 1 
011 -2 1 0/5 ]0 1 O -3 0 1 
00 1 1 1 -l 00 1 1 1 -l 
We conclude that matrix A is invertible and 
4 -3 1 
At*=|-3 0 1 
I 1 -l 


<q 


8.2.5. Overview of Determinants. A determinant is a scalar computed form a 
square matrix that gives important information about the matrix, for example if the matrix 


is invertible or not. We now review the definition and properties of the determinant of 2 x 2 
and 3 x 3 matrices. 


Definition 8.2.11. The determinant of a 2 x 2 matrix A= be ee 
21 G22 


| is given by 


Q41 412 
a21 422 


det(A) = 


= 411422 — 412421. 


441 412 413 
The determinant of a3 x 3 matrix A= |a21 a2 a3] is given by 


431 432 433 
411 412 413 
det(A) = ]a21 a22  a23| = a11 
431 432 433 


a22 423 
a32 433 


Example 8.2.23. The following three examples show that the determinant can be a neg- 
ative, zero or positive number. 

1 2 
3.4 


2 1 
3.4 


1 2 
2 4 


=4-6=-2, | [=8-3=5, | [=4-4=0 
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The following is an example shows how to compute the determinant of a 3 x 3 matrix, 


a a 11 21 21 
ei ahead 4 = 
5. 4 a) | i i i| + 5 
=(1-9).90 3) 4-3) 
=-1+3-1 
=a Z 


Remark: The determinant of upper or lower triangular matrices is the product of the 
diagonal coefficients. 


Example 8.2.24. Compute the determinant of a 3 x 3 upper triangular matrix. 


Solution: 


+ a13 


ie = 4114224 
0 0 33- 


<q 


The absolute value of the determinant of a 2 x 2 matrix A = [a,, a] has a geometrical 
meaning: It is the area of the parallelogram whose sides are given by a, and a, that is, by 
the columns of the matrix A; see Fig. 3. Analogously, the absolute value of the determinant 
of a 3 x 3 matrix A = [a,, ad, a3] also has a geometrical meaning: It is the volume of the 
parallelepiped whose sides are given by a, a and ag, that is, by the columns of the matrix 
A; see Fig. 3. 


Ton R?2 734 R3 


T% 2, 


FIGURE 3. Geometrical meaning of the determinant. 


The determinant of an n x n matrix A can be defined generalizing the properties that 
areas of parallelogram have in two dimensions and volumes of parallelepipeds have in three 
dimensions. One of these properties is the following: if one of the column vectors of the 
matrix A is a linear combination of the others, then the figure determined by these column 
vectors is not n-dimensional but (n—1)-dimensional, so its volume must vanish. We highlight 
this property of the determinant of n x n matrices in the following result. 


Theorem 8.2.12. The set of n-vectors {v,,--- , Un}, with n > 1, is linearly dependent iff 
det[v1,--+- , Un] = 0. 
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Example 8.2.25. Show whether the set of vectors below linearly independent, 


1 3 —3 
{ 2|, fal, | 2 \. 
3 1 7 
The determinant of the matrix whose column vectors are the vectors above is given by 
1 3 -3 
2 2 2) = (1) (14-2) —3(14-6) + (-3) (2—6) = 12-244 12=0. 
3 1 7 
Therefore, the set of vectors above is linearly dependent. <l 


The determinant of a square matrix also determines whether the matrix is invertible or not. 


Theorem 8.2.13. Ann xn matric A is invertible iff holds det(A) 4 0. 


1 2 8 
Example 8.2.26. Is matrix A= }|2 5 7] invertible? 
3.7 9 
Solution: We only need to compute the determinant of A. 
1 2 8 
det(AV=l2 5 7-2 J-@|? Vel? 9. 
37 «9 7 9 3.9 3°47 


Using the definition of determinant for 2 x 2 matrices we obtain 
det(A) = (45 — 49) — 2(18 — 21) + 3(14— 15) = —44 6-3. 


Since det(A) = —1, that is, non-zero, matrix A is invertible. <J 


362 


8.2.6. Exercises. 


8.2.1.- . 


8. REVIEW OF LINI 


EAR ALGEBRA 


8.2.2.- . 
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8.3. Eigenvalues and Eigenvectors 


We continue with the review on Linear Algebra we started in § 8.1 and § 8.2. We saw 
that a square matrix is a function on the space of vectors, since it acts on a vector and the 
result is another vector. In this section we see that, given an n x n matrix, there may exist 
lines through the origin in R” that are left invariant under the action of the matrix. This 
means that such a matrix acting on any vector in such a line is a vector on the same line. 
The vector is called an eigenvector of the matrix, and the proportionality factor is called 
an eigenvalue. If there is a linearly independent set of n eigenvectors of an n x n matrix, 
then this matrix is diagonalizable. In § 8.4 we will see that the exponential of a matrix is 
particularly simple to compute in the case that the matrix is diagonalizable. In Chapter 5 
we use the exponential of a matrix to write the solutions of systems of linear differential 
equations with constant coefficients. 


8.3.1. Eigenvalues and Eigenvectors. When a square matrix acts on a vector the 
result is another vector that, more often than not, points in a different direction from the 
original vector. However, there may exist vectors whose direction is not changed by the 
matrix. These will be important for us, so we give them a name. 


Definition 8.3.1. A number \ and a nonzero n-vector v are an eigenvalue with corre- 
sponding eigenvector (eigenpair) of ann x n matriz A iff they satisfy the equation 


Av= dv. 


Remark: We see that an eigenvector v determines a particular direction in the space R”, 
given by (av) for a € R, that remains invariant under the action of the matrix A. That 
is, the result of matrix A acting on any vector (av) on the line determined by v is again a 
vector on the same line, since 


A(av) = a Av=adv= X(av). 


Example 8.3.1. Verify that the pair \,, v; and the pair A,, v, are eigenvalue and eigen- 
vector pairs of matrix A given below, 


1 3 
A=|; i) -1 
Ap = —2 »=|q]- 


Solution: We just must verify the definition of eigenvalue and eigenvector given above. We 
start with the first pair, 


1 3] {1 4 1 
in [J B)-E]-sE as > anna 


A similar calculation for the second pair implies, 


1 3) |-1 2 —1 
Avo = F i | 4 = 3 =-2 | | = )oUo > Av, = \oU>. 


Example 8.3.2. Find the eigenvalues and eigenvectors of the matrix A = fi il : 
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Solution: This is the matrix given in Example 8.2.1. The action of this matrix on the plane 
is a reflection along the line x, = x2, as it was shown in Fig. 2. Therefore, this line x, = x, 
is left invariant under the action of this matrix. This property suggests that an eigenvector 
is any vector on that line, for example 


vf] > BAG-E > 5 


: : . : 1 
So, we have found one eigenvalue-eigenvector pair: A, = 1, with v, = | We remark that 


1 
any nonzero vector proportional to v, is also an eigenvector. Another choice fo eigenvalue- 


eigenvector pair is A, = 1, with v, = It is not so easy to find a second eigenvector 


3]: 
which does not belong to the line determined by v,. One way to find such eigenvector is 
noticing that the line perpendicular to the line x, = 2, is also left invariant by matrix A. 


Therefore, any nonzero vector on that line must be an eigenvector. For example the vector 


i, oleae Sie 
i i | = Ei ==) Fal eae ee 


fi 


: . F : -1 
So, we have found a second eigenvalue-eigenvector pair: A. = —1, with v, = | 1 i These 
two eigenvectors are displayed on Fig. 4. <J 
a) am) 
Lo = 
Ag 3 ; : Ag 
“Av = v. 
1. al 0 =e 
Uy 2 
x 
x 


FIGuRE 4. The first picture shows the eigenvalues and eigenvectors of the 
matrix in Example 8.3.2. The second picture shows that the matrix in 
Example 8.3.3 makes a counterclockwise rotation by an angle 0, which 
proves that this matrix does not have eigenvalues or eigenvectors. 


There exist matrices that do not have eigenvalues and eigenvectors, as it is show in the 
example below. 


Example 8.3.3. Fix any number 0 € (0, 27) and define the matrix A = bees 7 eH ' 


sin(?) — cos(@) 
Show that A has no real eigenvalues. 
Solution: One can compute the action of matrix A on several vectors and verify that the 


action of this matrix on the plane is a rotation counterclockwise by and angle 0, as shown in 
Fig. 4. A particular case of this matrix was shown in Example 8.2.2, where 6 = 7/2. Since 
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eigenvectors of a matrix determine directions which are left invariant by the action of the 
matrix, and a rotation does not have such directions, we conclude that the matrix A above 
does not have eigenvectors and so it does not have eigenvalues either. <J 


Remark: We will show that matrix A in Example 8.3.3 has complex-valued eigenvalues. 


We now describe a method to find eigenvalue-eigenvector pairs of a matrix, if they exit. 
In other words, we are going to solve the eigenvalue-eigenvector problem: Given an n x n 
matrix A find, if possible, all its eigenvalues and eigenvectors, that is, all pairs A and v 4 O 
solutions of the equation 


Av= Qu. 


This problem is more complicated than finding the solution x to a linear system Ax = b, 
where A and b are known. In the eigenvalue-eigenvector problem above neither A nor v are 
known. To solve the eigenvalue-eigenvector problem for a matrix A we proceed as follows: 


(a) First, find the eigenvalues A; 
(b) Second, for each eigenvalue , find the corresponding eigenvectors v. 


The following result summarizes a way to solve the steps above. 


Theorem 8.3.2 (Eigenvalues-Eigenvectors). 
(a) All the eigenvalues » of ann x n matrix A are the solutions of 
det(A — AZ) = 0. (8.3.1) 


(b) Given an eigenvalue X of ann x n matrix A, the corresponding eigenvectors v are the 
nonzero solutions to the homogeneous linear system 


(A—ADv=0. (8.3.2) 


Proof of Theorem 8.3.2: The number 4 and the nonzero vector v are an eigenvalue- 
eigenvector pair of matrix A iff holds 


Av=\v © (A-AI)v=0, 


where I is the n x n identity matrix. Since v 4 O, the last equation above says that the 
columns of the matrix (A — AL) are linearly dependent. This last property is equivalent, by 
Theorem 8.2.12, to the equation 

det(A — AT) = 0, 


which is the equation that determines the eigenvalues A. Once this equation is solved, 
substitute each solution \ back into the original eigenvalue-eigenvector equation 


(A-—AI)v= 0. 


Since A is known, this is a linear homogeneous system for the eigenvector components. It 
always has nonzero solutions, since A is precisely the number that makes the coefficient 
matrix (A — AJ) not invertible. This establishes the Theorem. 


Example 8.3.4. Find the eigenvalues \ and eigenvectors v of the matrix A = i ik 


Solution: We first find the eigenvalues as the solutions of the Eq. (8.3.1). Compute 


pte anh eb dap ais), 
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Then we compute its determinant, 


= Ay =A, 
o=aer(a— any = (05) a2 y)=e-v-9 4 ee 


We have obtained two eigenvalues, so now we introduce A, = 4 into Eq. (8.3.2), that is, 


ia s a 
Anar=|h ea a 


Then we solve for v* the equation 


(A-4I)v=0 © EB é| i ~ 3 


The solution can be found using Gauss elimination operations, as follows, 


ae ee Pe ee ee Vi = V2, 
5 23 f 23 0 O vi free. 


Al solutions to the equation above are then given by 


ee ea > em |" 
~ Tout} fa} 2 Hikes 


where we have chosen v; = 1. A similar calculation provides the eigenvector v” associated 


with the eigenvalue A. = —2, that is, first compute the matrix 
3.3 
seae le 8 


usneeo e & IP]-f 


The solution can be found using Gauss elimination operations, as follows, 
3 3] [1 1] fi 1 VY, = —%, 
3.3 3.3 0 0 vu, free. 
All solutions to the equation above are then given by 
v= ie a inte > v= = 
=lae = | ay al ie 


where we have chosen v; = 1. We therefore conclude that the eigenvalues and eigenvectors 
of the matrix A above are given by 


It is useful to introduce few more concepts, that are common in the literature. 


Definition 8.3.3. The characteristic polynomial of ann x n matrix A is the function 


p(d) = det(A— XJ). 
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Example 8.3.5. Find the characteristic polynomial of matrix A = EF 4 ; 


Solution: We need to compute the determinant 


p(a) = det(A— a2) = |) a2 [=O dP? -9= 8 A+ 1-9. 


We conclude that the characteristic polynomial is p(A) = \? — 2\ — 8. <J 


Since the matrix A in this example is 2 x 2, its characteristic polynomial has degree 
two. One can show that the characteristic polynomial of an n x n matrix has degree n. The 
eigenvalues of the matrix are the roots of the characteristic polynomial. Different matrices 
may have different types of roots, so we try to classify these roots in the following definition. 


Definition 8.3.4. Given annxn matrix A with real eigenvalues A;, wherei =1,--- ,k <n, 
it is always possible to express the characteristic polynomial of A as 


p(A) = (A = oe setts (A _ Nr)"*. 


The number r; is called the algebraic multiplicity of the eigenvalue ;. Furthermore, the 
geometric multiplicity of an eigenvalue ;, denoted as s;, is the maximum number of 
eigenvectors of A; that form a linearly independent set. 


Example 8.3.6. Find the eigenvalues algebraic and geometric multiplicities of the matrix 


ak 


Solution: In order to find the algebraic multiplicity of the eigenvalues we need first to find 
the eigenvalues. We now that the characteristic polynomial of this matrix is given by 


pON= (1 — A) 3 


i (1d) = (A-1)?-9. 


The roots of this polynomial are A, = 4 and A, = —2, so we know that p(A) can be rewritten 
in the following way, 
p(A) = (A—4)(A + 2). 
We conclude that the algebraic multiplicity of the eigenvalues are both one, that is, 
A, =4, m1=1, and Ag =—-2, rea=l1. 


In order to find the geometric multiplicities of matrix eigenvalues we need first to find the 
matrix eigenvectors. This part of the work was already done in the Example 8.3.4 above 


and the result is 
1 —1 
1Y=4, vw = Hl ; 2 =—-2, v= | | : 


From this expression we conclude that the geometric multiplicities for each eigenvalue are 
just one, that is, 
A,=4, s,=1, and Ag=-2, s,=1. 
dq 


The following example shows that two matrices can have the same eigenvalues, and so the 
same algebraic multiplicities, but different eigenvectors with different geometric multiplici- 
ties. 
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3.0 «#21 
Example 8.3.7. Find the eigenvalues and eigenvectors of the matrix A= |0 3 2]. 
0 0 1 


Solution: We start finding the eigenvalues, the roots of the characteristic polynomial 
(3 — ) 0 1 
pAl=| O (3 — A) 2 |=-(A-1)(A-3)? => { 
0 0 (1—A) 
We now compute the eigenvector associated with the eigenvalue A, = 1, which is the solution 
of the linear system 


A, =1, m=1, 
A= 3, 2 = 2. 


2:0 4 0 
(A-Dvw =0 & [0 2 2] Jy] = Jo 
0 0 0 0 


After the few Gauss elimination operation we obtain the following, 


(1) __ _ Us 
2 0 1 10 4 rr a 
0 2 2} 45 40 1 1 => oo) — yO) 
0 0 0 0 0 0 4 -* 
yf) free. 
Therefore, choosing yl) = 2 we obtain that 
—1 
vo) = |-2], 4 =1, r,=1, $,=1. 
2 


In a similar way we now compute the eigenvectors for the eigenvalue A, = 3, which are all 
solutions of the linear system 


0 0 0 
(A-31)v)=0 & 10 0 2] Iv] =I]0 
Gt 2 : | 0 


After the few Gauss elimination operation we obtain the following, 


(2) 

00 1 001 a 

00 2/>]0 0 of s 2) free, 

0 0 -2 0 0 0 (2) 

v3 =0. 
Therefore, we obtain two linearly independent solutions, the first one v) with the choice 
y?) = 1; y?) = 0, and the second one w® with the choice y?) = 0, y?) = 1, that is 
1 0 
v?) = 0] , w= }1], ss =3, —- S$, =2 


0 0 


Summarizing, the matrix in this example has three linearly independent eigenvectors. < 


Example 8.3.8. Find the eigenvalues and eigenvectors of the matrix A = 


oOW 


1 1 
3 2). 
0 1 
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Solution: Notice that this matrix has only the coefficient a 2 different from the previous 
example. Again, we start finding the eigenvalues, which are the roots of the characteristic 
polynomial 


Gs 4 1 


p(A)=| 0 (3-2) 2 |=-A-DOA-3? = { 
0 0 (1—) 


M=1, re =1, 
Ay = 38, T = 2. 
So this matrix has the same eigenvalues and algebraic multiplicities as the matrix in the 


previous example. We now compute the eigenvector associated with the eigenvalue », = 1, 
which is the solution of the linear system 


a a 0 
(A-Dw=0 & |0 2 2| Ju] = Jo 
0 0 0 0 


After the few Gauss elimination operation we obtain the following, 


oa, 


2 4 i 111 1 0 0 = 
02 2Sloa 1) loa 2) Ss ae aug! 
00 0 00 0 00 0 (1) 
v free. 
Therefore, choosing vs = 1 we obtain that 
wv) = |-1], A =1, ry =1, s,=1. 


1 


In a similar way we now compute the eigenvectors for the eigenvalue \, = 3. However, in 
this case we obtain only one solution, as this calculation shows, 


01 1 Ui 0 

(A-—31vW=0 & |0 0 2] [vy] = Jo 

0 0 -2 py 0 
3 

After the few Gauss elimination operation we obtain the following, 
(2) 

of 4 010 a 8 

00 2/3/00 1, = 7 =0, 
0 0 -2 0 0 0 (2) 

v3’ = 0. 


Therefore, we obtain only one linearly independent solution, which corresponds to the choice 
vy = 1, that is, 
1 
v?) = |0 ‘ Ao = 3, Ts = 2, 8, =1. 
0 
Summarizing, the matrix in this example has only two linearly independent eigenvectors, 
and in the case of the eigenvalue A, = 3 we have the strict inequality 


5 <= 2: 
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8.3.2. Diagonalizable Matrices. We first introduce the notion of a diagonal matrix. 
Later on we define a diagonalizable matrix as a matrix that can be transformed into a 
diagonal matrix by a simple transformation. 


Definition 8.3.5. Ann xn matriz A is called diagonal iff A = 


That is, a matrix is diagonal iff every nondiagonal coefficient vanishes. From now on we use 
the following notation for a diagonal matrix A: 


ail wae 0 
A => diag lau: ae ae = 
0 woe Onn 


This notation says that the matrix is diagonal and shows only the diagonal coefficients, 
since any other coefficient vanishes. The next result says that the eigenvalues of a diagonal 
matrix are the matrix diagonal elements, and it gives the corresponding eigenvectors. 


Theorem 8.3.6. If D = diag{di1,--- ,dnn|, then eigenpairs of D are 


iL 0 
ret, BOS | oy Seed eS |: 
0 
0 1 


Diagonal matrices are simple to manipulate since they share many properties with 
numbers. For example the product of two diagonal matrices is commutative. It is simple to 
compute power functions of a diagonal matrix. It is also simple to compute more involved 
functions of a diagonal matrix, like the exponential function. 


Example 8.3.9. For every positive integer n find A", where A = fi 4 ; 


Solution: We start computing A? as follows, 
2 _ [2 oO] [2 o] _ f2? 0 
A= AA=[) lk =lo | 


2 0] [2°06 2-0 
3 ydy - 
#=#4=|5 oll Sl=[0 9): 


Using induction, it is simple to see that A” = , a : <J 


We now compute A®, 


Many properties of diagonal matrices are shared by diagonalizable matrices. These are 
matrices that can be transformed into a diagonal matrix by a simple transformation. 


Definition 8.3.7. Annxn matriz A is called diagonalizable iff there exists an invertible 
matriz P and a diagonal matriz D such that 


A=PDP™. 
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Remarks: 

(a) Systems of linear differential equations are simple to solve in the case that the coefficient 
matrix is diagonalizable. One decouples the differential equations, solves the decoupled 
equations, and transforms the solutions back to the original unknowns. 

(b) Not every square matrix is diagonalizable. For example, matrix A below is diagonaliz- 


able while B is not, 
1 3 1/3 #1 
del ile B=5|4 al: 


Example 8.3.10. Show that matrix A = E | is diagonalizable, where 


1 -1 4 0 
pa[t 2] aw v= [9] 


Solution: That matrix P is invertible can be verified by computing its determinant, det(P) = 
1 —(-1) = 2. Since the determinant is nonzero, P is invertible. Using linear algebra meth- 
1 

ods one can find out that the inverse matrix is P~! = 7 fe : 


verify that PDP~! is indeed A. A straightforward calculation shows 


-_f1 -1)f4 o]1f1 1 
al al al Pew 


. Now we only need to 


I 


ee 
ao 
i) 
| 
[eR 
| 
are 
a 
a | 


lI 
eat i Sa See | 
eP bw bw 
bees 
an 
a | 


w 
Ke 


J 


There is a deep relation between the eigenpairs of a matrix and whether that matrix is 
diagonalizable. 


Theorem 8.3.8 (Diagonalizable Matrix). An n x n matrix A is diagonalizable iff A has 
a linearly independent set of n eigenvectors. Furthermore, if X;,v;, fori = 1,--- ,n, are 
eigenpairs of A, then 


A=PDP", P=|H,--+,%,), B=disg [A, °° stva|'s 


Proof of Theorem 8.3.8: 

(=) Since matrix A is diagonalizable, there exist an invertible matrix P and a diagonal 
matrix D such that 4 = PDP~!. Multiply this equation by P~! on the left and by P on 
the right, we get 

Da PAP. (8.3.3) 
Since n x n matrix D is diagonal, it has a linearly independent set of n eigenvectors, given 
by the column vectors of the identity matrix, that is, 


De = dye, D = diag [dii,--- ,dnn] , T= [e®,.-- el]. 
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So, the pair dj, e is an eigenvalue-eigenvector pair of D, for i = 1---,n. Using this 
information in Eq. (8.3.3) we get 

de =De& =P 7AP™O = A(Pe) = di (Pe), 


where the last equation comes from multiplying the former equation by P on the left. This 
last equation says that the vectors v = Pe” are eigenvectors of A with eigenvalue dij. 
By definition, v is the i-th column of matrix P, that is, 


P= [v®,--- vl]. 


Since matrix P is invertible, the eigenvectors set {v,--- , v™} is linearly independent. 
This establishes this part of the Theorem. 


(<) Let A;, v be eigenvalue-eigenvector pairs of matrix A, for i= 1,--- ,n. Now use 
the eigenvectors to construct matrix P = [v,--. , v™]. This matrix is invertible, since the 


eigenvector set {v“),--- , v(”)} is linearly independent. We now show that matrix P~!AP 
is diagonal. We start computing the product 


AP = A[v@),--- ul] = [Av®,---, Av] = Pv dna) - 


that is, 
P7'AP = Pi rv, ect Anv™] — [AP to), - a An Pty) ] . 


At this point it is useful to recall that P~! is the inverse of P, 
T=P 'P 3s [e),--- ,el")] =p! [vi ,.-- ul”) = [P-tol),--- , Py] . 
So, e = P~!y, fori =1--- ,n. Using these equations in the equation for P~! AP, 
PAP = [\ye,--» , Anel™] = diag [Ai,--+ An] - 
Denoting D = diag [A1, ee rv | we conclude that P~!AP = D, or equivalently 
A=PDP", P= [vl),.-- , of] , D = diag [A1,--+ , An] - 


This means that A is diagonalizable. This establishes the Theorem. 


Example 8.3.11. Show that matrix A = E | is diagonalizable. 
Solution: We know that the eigenvalue-eigenvector pairs are 


At = 4, U= H and r2 = —2, v= ral . 


Introduce P and D as follows, 
ef aps tL] off 3) 
We must show that A = PDP~!. This is indeed the case, since 
rom-fE 6 3 I) 
PDP = r E| ; E | = b Ee E Hl 


1 3 


We conclude, PDP! = E | => PDP-'!=A, that is, A is diagonalizable. <J 


With Theorem 8.3.8 we can show that a matrix is not diagonalizable. 
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1 
Example 8.3.12. Show that matrix B = 5 E | is not diagonalizable. 


Solution: We first compute the matrix eigenvalues. The characteristic polynomial is 


GS) 
5 5 3 5 1 
a) =|\2 2 = (S-a)(S-a) +5 = -4044 
p( ) i. 5 _ .) 2 2 4 7 
2 2 
The roots of the characteristic polynomial are computed in the usual way, 
1 


A= 5[4+ v16 — 16 > r=2, r=2. 


We have obtained a single eigenvalue with algebraic multiplicity r = 2. The associated 
eigenvectors are computed as the solutions to the equation (A — 2I)v = 0. Then, 


$y) 1 5 


2 2 a 
es (2-2) =! 
2 2 5} 


We conclude that the biggest linearly independent set of eigenvalues for the 2 x 2 matrix B 
contains only one vector, insted of two. Therefore, matrix B is not diagonalizable. <J 


Theorem 8.3.8 shows the importance of knowing whether an n x n matrix has a linearly 
independent set of n eigenvectors. However, more often than not, there is no simple way to 
check this property other than to compute all the matrix eigenvectors. But there is a simpler 
particular case, the case when an n x n matrix has n different eigenvalues. Then, we do not 
need to compute the eigenvectors. The following result says that such matrix always have 
a linearly independent set of n eigenvectors, hence, by Theorem 8.3.8, it is diagonalizable. 


Theorem 8.3.9 (Different Eigenvalues). If an n x n matrix has n different eigenvalues, 
then this matriz has a linearly independent set of n eigenvectors. 


Proof of Theorem 8.3.9: Let \1,:--,A, be the eigenvalues of an n x n matrix A, 
all different from each other. Let v“),---,u(™ the corresponding eigenvectors, that is, 
Av = d,v, with i = 1,--- ,n. We have to show that the set {v,--- , o(”)} is linearly 
independent. We assume that the opposite is true and we obtain a contradiction. Let us 
assume that the set above is linearly dependent, that is, there are constants ¢,,--- ,Cn, not 
all zero, such that, 


qu) eee ee Cn v™ = 0. (8.3.4) 


Let us name the eigenvalues and eigenvectors such that c, 4 0. Now, multiply the equation 
above by the matrix A, the result is, 


CAV) +--+ + epAnv™ = 0. 
Multiply Eq. (8.3.4) by the eigenvalue \,,, the result is, 
CAnVY +--+ cpdnv™ = 0. 


Subtract the second from the first of the equations above, then the last term on the right- 
hand sides cancels out, and we obtain, 


Ci hy — Nee has Goa 4 Og dy oO =o. (8.3.5) 
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Repeat the whole procedure starting with Eq. (8.3.5), that is, multiply this later equation 
by matrix A and also by A,_1, then subtract the second from the first, the result is, 


c1(A1 =, An) (At — An—1)0) ap ease Cn—2(An—2 — An) (An—2 —* wae = 0. 
Repeat the whole procedure a total of n — 1 times, in the last step we obtain the equation 
e1(Ar — An)Oa — Ani) +++ Ar — As)On — Ag)o = 0. 


Since all the eigenvalues are different, we conclude that c, = 0, however this contradicts our 
assumption that c, 4 0. Therefore, the set of n eigenvectors must be linearly independent. 
This establishes the Theorem. 


Example 8.3.13. Is matrix A = ki 


1) 3. j 
| diagonalizable? 
Solution: We compute the matrix eigenvalues, starting with the characteristic polynomial, 


=a) 7 
BA dad) 


The roots of the characteristic polynomial are the matrix eigenvalues, 
Mt = 0, 2 = 2. 


The eigenvalues are different, so by Theorem 8.3.9, matrix A is diagonalizable. <J 


=(1—A)?-1=)\?-2 = pid) =A(A- 2). 
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8.3.3. Exercises. 


8.3.1.- . 8.3.2.- . 


376 8. REVIEW OF LINEAR ALGEBRA 


8.4. The Matrix Exponential 


When we multiply two square matrices the result is another square matrix. This property 
allow us to define power functions and polynomials of a square matrix. In this section we 
go one step further and define the exponential of a square matrix. We will show that the 
derivative of the exponential function on matrices, as the one defined on real numbers, is 
proportional to itself. 


8.4.1. The Exponential Function. The exponential function defined on real num- 
bers, f(z) = e%”, where a is a constant and x € R, satisfies f’(z) = a f(x). We want to 
find a function of a square matrix with a similar property. Since the exponential on real 
numbers can be defined in several equivalent ways, we start with a short review of three of 
ways to define the exponential e”. 


(a) The exponential function can be defined as a generalization of the power function from 
the positive integers to the real numbers. One starts with positive integers n, defining 


e" =e---e, n-times. 
Then one defines e° = 1, and for negative integers —n 
1 
en’ 


m 
The next step is to define the exponential for rational numbers, —, with m,n integers, 
n 


en = Vem. 


The difficult part in this definition of the exponential is the generalization to irrational 
numbers, x, which is done by a limit, 


It is nontrivial to define that limit precisely, which is why many calculus textbooks do 
not show it. Anyway, it is not clear how to generalize this definition from real numbers 
x to square matrices X. 


(b) The exponential function can be defined as the inverse of the natural logarithm function 
g(x) = In(x), which in turns is defined as the area under the graph of the function 


a 
h(a) = — from 1 to a, that is, 
av 


“74. 
n(x) = fo dy, © € (0,00). 


Again, it is not clear how to extend to matrices this definition of the exponential function 
on real numbers. 


(c) The exponential function can be defined also by its Taylor series expansion, 


CO Lk 
x __ BS ae | Fg te ec eis ae 
CPD ag PaO ee a 
k=0 


Most calculus textbooks show this series expansion, a Taylor expansion, as a result from 
the exponential definition, not as a definition itself. But one can define the exponential 
using this series and prove that the function so defined satisfies the properties in (a) 
and (b). It turns out, this series expression can be generalized square matrices. 
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We now use the idea in (c) to define the exponential function on square matrices. We 
start with the power function of a square matrix, f(X) = X" = X---X, n-times, for X a 
square matrix and n a positive integer. Then we define a polynomial of a square matrix, 


p(X) = anX” + an 1X1 +--+ aol. 
Now we are ready to define the exponential of a square matrix. 


Definition 8.4.1. The exponential of a square matrix A is the infinite sum 


e4 = = = (8.4.1) 


This definition makes sense, because the infinite sum in Eq. (8.4.1) converges. 


Theorem 8.4.2. The infinite sum in Eq. (8.4.1) converges for all n x n matrices. 


Proof: See Section 2.1.2 and 4.5 in Hassani [6] for a proof using the Spectral Theorem. 


The infinite sum in the definition of the exponential of a matrix is in general difficult 
to compute. However, when the matrix is diagonal, the exponential is remarkably simple. 


Theorem 8.4.3 (Exponential of Diagonal Matrices). If D = diag (di, vee Hal , then 


eP? = diag [e%,--- ,e4>] . 


Proof of Theorem 8.4.3: We start from the definition of the exponential, 


P =F (diag [as,-++ sdn])* = S> Feiag [ld --+ (dn). 
k=0 k=0 


where in the second equallity we used that the matrix D is diagonal. Then, 


oS _ d,)* aa” c oS d,)* _ Gaye 
P= Y aiog| Sy a ] =diag[S> a yo | 


k=0 k=0 
o° k 
Each sum in the diagonal of matrix above satisfies S- he e*‘. Therefore, we arrive to 
k=0.."* 
the equation e? = diag le", ee et]. This establishes the Theorem. 


0 7 


Solution: We follow the proof of Theorem 8.4.3 to get this result. We start with the 
definition of the exponential 


Example 8.4.1. Compute e“, where A = F "| : 


Aad deal lb 7 


Since the matrix A is diagonal, we have that 
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Therefore, 
sale elle gl-Po* oss] 
A _ os nl a n=0 n! 
e = rs n —_— nm — [e-e) nm . 
» n! F 7 d 0 + 0 er SG 
co Te E 7 e? 0 
Since )77—o0 Sr = e*, for a = 2, 7, we obtain that e = i al: <J 


8.4.2. Diagonalizable Matrices Formula. The exponential of a diagonalizable ma- 
trix is simple to compute, although not as simple as for diagonal matrices. The infinite sum 
in the exponential of a diagonalizable matrix reduces to a product of three matrices. We 
start with the following result, the nth-power of a diagonalizable matrix. 


Theorem 8.4.4 (Powers of Diagonalizable Matrices). If ann xn matrix A is diagonalizable, 
with invertible matrix P and diagonal matrix D satisfying A = PDP", then for every 
integer k > 1 holds 

Pa PiPr. (8.4.2) 
Proof of Theorem 8.4.4: For any diagonal matrix D = diag (di, vee «an we know that 

D* =diag |d?s++, de] 

We use this result and induction in n to prove Eq.(8.4.2). Since the case n = 1 is trivially 
true, we start computing the case n = 2. We get 


A? = (PDP-')’ = (PDP~')(PDP“') = PDDP"! = A? = PDP, 


that is, Eq. (8.4.2) holds for k = 2. Now assume that Eq. (8.4.2) is true for k. This equation 
also holds for k + 1, since 


Aes PA= (PD) PoP a Por por Sa Pp DP. 
We conclude that A(t!) = PD‘*+1) P-!. This establishes the Theorem. 


We are ready to compute the exponential of a diagonalizable matrix. 


Theorem 8.4.5 (Exponential of Diagonalizable Matrices). If ann x n matriz A is diago- 
nalizable, with invertible matrix P and diagonal matrix D satisfying A= PDP~', then the 
exponential of matrix A is given by 


e4 = Pe? P~}, (8.4.3) 


Remark: Theorem 8.4.5 says that the infinite sum in the definition of e4 reduces to a 
product of three matrices when the matrix A is diagonalizable. This Theorem also says that 
to compute the exponential of a diagonalizable matrix we need to compute the eigenvalues 
and eigenvectors of that matrix. 


Proof of Theorem 8.4.5: We start with the definition of the exponential, 
1 1 1 
A _ n _ —1\n _ np-l 
e =) W4 =o (PPP ) => (PP Pp. 
k=0 k=0 k=0 
where the last step comes from Theorem 8.4.4. Now, in the expression on the far right we 
can take common factor P on the left and P~! on the right, that is, 


oO 4 

A_ n —1 

€ 20 — D") P ; 
=0 
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The sum in between parenthesis is the exponential of the diagonal matrix D, which we 
computed in Theorem 8.4.3, 
e4 = Pe? P~}, 


This establishes the Theorem. 


We have defined the exponential function F(A) = e4 : R"*" — R"*", which is a 
function from the space of square matrices into the space of square matrices. However, 
when one studies solutions to linear systems of differential equations, one needs a slightly 
different type of functions. One needs functions of the form F(t) = e4* : R 4 R"*", 
where A is a constant square matrix and the independent variable is t € R. That is, one 
needs to generalize the real constant a in the function f(t) = e*’ to an n x n matrix A. 
In the case that the matrix A is diagonalizable, with A = PDP™!, so is matrix At, and 
At = P(Dt)P~'. Therefore, the formula for the exponential of At is simply 


eft — PePtp-1, 


We use this formula in the following example. 


Example 8.4.2. Compute e“', where A = EF andt eR. 


Solution: To compute e4* we need the decomposition A = PDP~!, which in turns im- 
plies that At = P(Dt)P~'. Matrices P and D are constructed with the eigenvectors and 
eigenvalues of matrix A. We computed them in Example ??, 


At = A, y= Hl and dQ = —2, Vv2= | . 


Introduce P and D as follows, 


J a 4 iT 4 [4 0 
p=|; | aad = ils p= |) ak 


Then, the exponential function is given by 


point |) —i)fe* Oo) [1 4 
oo =|; ells e7| 3 |-1 1° 


Usually one leaves the function in this form. If we multiply the three matrices out we get 


ea) 


— 9 (ett _ a) (et 4 eo) 


<J 


8.4.3. Properties of the Exponential. We summarize some simple properties of the 
exponential function in the following result. We leave the proof as an exercise. 


Theorem 8.4.6 (Algebraic Properties). If A is ann x n matrix, then 
(a) If 0 is then x n zero matriz, then e® = I. 
(b) (e4)* - elAT) where T means transpose. 


(c) For all nonnegative integers k holds A® e4 = e4 A*. 
(d) If AB = BA, then Ae? =e? A and ec? =F eA. 


An important property of the exponential on real numbers is not true for the exponential 
on matrices. We know that e% e? = e+? for all real numbers a, b. However, there exist nx n 
matrices A, B such that e4 e? 4 e4+®. We now prove a weaker property. 
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Theorem 8.4.7 (Group Property). If A is ann xn matrix and s, t are real numbers, then 


e4s eft = eAlstt) | 


Proof of Theorem 8.4.7: We start with the definition of the exponential function 


As cAt — (o A > (3 ") = wy ee a 


j! 


We now introduce the new label n = 7 +k, then 7 = n — k, and we reorder the terms, 


A” gr- k tk Ses A” im ] os 
“>> Gp 2a a (ewe ‘t). 


n=0 k= 


! 
If we recall the binomial theorem, (s +t)” = >» joe s”—* +k we get 
n—k)!k! 


A 
As (At n A(s+t) 
= —_ i) = ‘ 
ee y (s +t) € 


This establishes the Theorem. 


If we set s = 1 and t = —1 in the Theorem 8.4.7 we get that 


feta At) ome 7 


so we have a formula for the inverse of the exponential. 


Theorem 8.4.8 (Inverse Exponential). If A is ann x n matrix, then 


(4) =e“, 


Example 8.4.3. Verify Theorem 8.4.8 for e4*, where A = F : 


] and te. 


Solution: In Example 8.4.2 we found that 


ait ie +e) (ett — “) 


9 (e* -_ a) (e*# +e 7) 


A 2 x 2 matrix is invertible iff its determinant is nonzero. In our case, 


1 1 1 1 
det (c4") = 5 (e+e 2) (e+e 2t) 5 (ef F 2) = ( At 2t) ot. 
hence e“* is invertible. The inverse is 
Coa | (eM re") (el + S| 
e2t 2 |(—e* +e-%) (e## +e7%) |’ 
that is 


(Ay! _ifé +e) (—ef + et) z= (e~4t 4 24) (e-4t — 62) sgt 
2 |(-e7#+e-*) (e% + e~*) 2 


We now want to compute the derivative of the function F(t) = e4*, where A is a 


constant n x n matrix and t € R. It is not difficult to show the following result. 
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Theorem 8.4.9 (Derivative of the Exponential). If A is ann x n matriz, andt € R, then 


d At At 
—e" = Ae”, 
46° e 


Remark: Recall that Theorem 8.4.6 says that Ae4 = e4 A, so we have that 


a at =Aett=eAA. 
dt 


First Proof of Theorem 8.4.9: We use the definition of the exponential, 


diy d SAMM™ GA dad, Garr ae ae a 
dt ae n! =), n! at I 2a gi (n—1)! ’ 


n=0 n= n= 


therefore we get 


ce = Ae", 


This establishes the Theorem. 


Second Proof of Theorem 8.4.9: We use the definition of derivative and Theorem 8.4.7, 


A(t+h) _ ,At At ,Ah _ ,At Ah _ J 
F’(t) = Jim ————*— = Jim "= e* (lim © } 
h—-0 h—0 h h—0 
and using now the power series definition of the exponential we get 
1 A*h? 
if _ ,At |; = ee — At 
F'(t) =e [Jim | (An+ I + )|=e A. 
This establishes the Theorem. 
Example 8.4.4. Verify Theorem 8.4.9 for F(t) = e4*, where A = i | and t € R. 
Solution: In Example 8.4.2 we found that 
oAt 7 it (ett + e-2#) (ett _ e~t) 
~ 9 |(e4# — e724) (ett + e724) 
Therefore, if we derivate component by component we get 
d Ae. 1 (4e% — 2e-*) (4e% + 26-2) 
di 2 |(4e4*# + 2e7*) (det —2e-2)|° 
On the other hand, if we compute 
Act z 1 3 il (eo te es) (e*# _ eo") 
3 112 (e*# — et) (e*# + e~*4) 
_ 1[(4e—2e-%)  (4e% + 2e7%) 
—_ 9 (4e% + 2e72) (4e% — 2e-?*) 
dat At _ a at Ak Ava : bo 
Therefore, oS Ae. The relation = A is shown in a similar way. <J 
We end this review of the matrix exponential showing when formula e4+? = e4 e? 


holds. 
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Theorem 8.4.10 (Exponent Rule). If A, B aren x n matrices such that AB = BA, then 


gts = eA e”. 


Proof of Theorem 8.4.10: Introduce the function 
F(t) = e(AtB)t e— Bt pat 
where t € R. Compute the derivative of F, 
F'(t) = (A+ B) Att e-Bt eAt 4 @(AtB)t (_ B) e~Bt At 4 e(A+B)t @— Bt (_ A) e~ At, 
Since AB = BA, we know that e~?7 A = Ae~®*, so we get 
F'(t) = (A+ B) e(A+B)t ¢—Bt e-At _ ((AtB)t Be —Bt e—At _ G(AFB)t 4 e—Bt e— At, 

Now AB = BA also implies that (A+ B) B = B(A+ B), therefore Theorem 8.4.6 implies 
eAtTB)t B= BelAtTBIt, 

Analogously, we have that (A+ B) A= A(A+ B), therefore Theorem 8.4.6 implies that 
e(AtB)t 4 = AclAtB)t. 

Using these equation in F” we get 

F(t) =(A+B)F(t)-BF(t)-AF(t) => F'(t)=0. 
Therefore, F(t) is a constant matrix, F(t) = F(0) = J. So we get 


a(AtB)t -Bte-At7 as plate) — pAb p Bt 


This establishes the Theorem. 
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8.4.4. Exercises. 


8.4.1.- Use the definition of the matrix ex- 
ponential to prove Theorem 8.4.6. Do 
not use any other theorems in this Sec- 
tion. 


8.4.2.- Compute e“ for the matrices 


8.4.3.- Show that, if A is diagonalizable, 
det e*) =e"). 


Remark: This result is true for all 
square matrices, but it is hard to prove 
for nondiagonalizable matrices. 


8.4.4.- A square matrix P is a projection iff 
aa 
Compute the exponential of a projec- 


tion matrix, e?. Your answer must not 
contain any infinite sum. 


8.4.5.- If A? = J, show that 


8.4.6.- If \ and v are an eigenvalue and 
eigenvector of A, then show that 


A 
e vu=e wv. 


8.4.7.- Compute e“ for the matrices 


(@) 4=|) i 


8.4.8.- * Compute the function F(t) = e“* 
for any real number t and 


4 -1 
A= E | | 
8.4.9.- Compute the function F(t) = e*¢ 
for any real number t and 


—7 2 
aa ee | , 
8.4.10.- By direct computation show that 
eAtB) & 04 @F for 


4=[o of =[0 ol 


CHAPTER 9 


Appendices 


A. Overview of Complex Numbers 


The first public appearance of complex numbers was in 1545 Gerolamo Cardano’s Ars 
Magna, when he published a way to find solutions of a cubic equation ax? + br +c = 0. The 
solution formula was not his own but given to him sometime earlier by Scipione del Ferro. 
In order to get such formula there was a step in the calculation involving a /—1, which was 
a mystery for the mathematicians of that time. There is no real number so that its square 
is —1, so what the heck does this symbol /—1 mean? More intriguing, a few steps later 
during the calculation, this /—1 cancels out, and it does not appear in the final formula for 
the roots of the cubic equation. It was like a ghost entered your calculation and walked out 
of it without leaving a trace. Maybe we should call them magic numbers. 

Everything in nature is magic until we understand how it works, then knowledge ad- 
vances and magic retreats, one step at a time. It took a while, until the beginning of the 
19th century with the—independent but almost simultaneous—works of Karl Gauss and 
William Hamilton, but our magic numbers were finally understood and they became the 
complex numbers. 

In spite of their name, there is nothing complex about complex numbers. Planar num- 
bers is a better fit to what they are—the set of all ordered pairs of real numbers together 
with specific addition and multiplication rules. Complex numbers can be identified with 
points on a plane, in the same way that real numbers can be identified with points on a line. 


Definition A.1. Complex numbers are numbers of the form 


(a,b), 
where a and b are real numbers, together with the operations of addition, 
(a,b) + (c,d) = (a+c,b+d), (A.1) 
and multiplication, 
(a, b)(c, d) = (ac — bd, ad + be). (A.2) 


The operation of addition is simple to understand because it is exactly how we add 
vectors on a plane, 


(a,b) + (c,d) = ((a +e), (b+d)). 
It is the multiplication what distinguishes complex numbers from vectors on the plane. To 
understand these operations it is useful to start with the following properties. 


Theorem A.2. The addition and multiplication of complex number are commutative, as- 
sociative, and distributive. That is, given arbitrary complex numbers x, y, and z holds 

(a) Commutativity: e+ y=yta andxy=yz. 

(b) Associativity: x + (y+z) =(a+y)+2 and x (yz) = (xy)z. 

(c) Distributivity: «(y+ z) =avytarz. 
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Proof of Theorem A.2: We show how to prove one of these properties, the proof for the 
rest is similar. Let’s see the commutativity of multiplication. Given the complex numbers 
x = (a,b) and y = (c,d) we have 

xy = (a,b)(c,d) = ((ac — bd), (ad + be)) 
and 

yx = (c,d)(a,b) = ((ca — db), (cb + da)) 
therefore we get that xy = yx. The rest of the properties can be proven in a similar way. 
This establishes the Theorem. 


We now mention a few more properties of complex numbers which are straightforward 
from the definitions above. For all complex numbers (a,b) we have that 
(0,0) + (a, 6) = (a, 6) 
(—a, =b) + (a, b) = (0, 0) 
(a, b)(1,0) = (a, b). 
From the first equation above the complex number (0, 0) is called the zero complex number. 
From the second equation above the complex number (—a, —)) is called the negative of (a, b), 
and we write 
—(a,b) = (—a,—b). 
From the last equation above the complex number (1,0) is called the identity for the mul- 
tiplication. 
The inverse of a complex number (a,b), denoted as (a,b)~!, is the complex number 
satisfying 
(a,b) (a,b)~* = (1,0). 
Since the inverse of a complex number is itself a complex number, it can be written as 


(a, ve = (c, d) 


for appropriate components c and d. The next result gives us a formula for these components. 
The next result says that every nonzero complex number has an inverse. 


Theorem A.3. The inverse of (a,b), with eithera £0 orb #40, is 


(a,5)* = (a : By a): (A.3) 


Proof of Theorem A.3: A complex number (a,b)~! is the inverse of (a, b) iff 
(a,b) (a,b)~* = (1,0). 
When we write (a, b)~' = (c,d), the equation above is 
(a, b)(c,d) = (1,0). 
If we compute explicitly the left-hand side above we get 
((ac — bd), (ad + be)) = (1,0). 
The equation above implies two equations for real numbers, 
ac — bd = 1, ad+ be = 0. 
In the case that either a 4 0 or b £0, the solution to the equations above is 
a —b 


© +B)’ (a? +) 
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Therefore, the inverse of (a, b) is 


—b 
j= a 
ab) (cama aR) 
This establishes the Theorem. 


Example A.1. Find the inverse of (2,3). Then, verify your result. 
Solution: The formula above says that (2,3)~! is given by 


03" = (olay asm) = @ar'=(5.5). 


This is correct, since 


dq 


A.1. Extending the Real Numbers. The set of all complex numbers of the form 
(a,0) satisfy the same properties as the set of all real numbers a. Indeed, for all a, c reals 
holds 

(a, 0) + (c,0) = (a+c,0), (a,0)(c, 0) = (ac, 0). 
We also have that 


—(a, 0) = (—a, 0), 
and the formula above for the inverse of a complex number says that 
1 
(a,0)-! = (-.0). 
a 


From here it is natural to identify a complex number (a,0) with the real number a, that is, 
(a,0) <> a. 


This identification suggests the following definition. 


Definition A.4. The real part of z = (a,b) is a and—then it is natural to call—the 
imaginary part of z is b. We also use the notation 


a = Re(z), b = Im(z). 


A.2. The Imaginary Unit. We understood complex numbers of the form (a, 0). They 
are no more than the real numbers. Now we study complex numbers of the form (0, b)— 
complex numbers with no real part. In particular, we focus on the complex number (0,1), 
which we call the imaginary unit. Let us compute its square, 


Within the complex numbers we do have a number whose square is negative one, and that 


number is the imaginary unit (0,1). Actually, there are two complex numbers whose square 
is negative one, one is (0,1) and the other is —(0,1), because 


(0, —1)? = (0, —1)(0, —1) = (0 — (—1)(—1),0 +0) = (—1,0) = —(1,0). 
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So, in the set of complex numbers we do have solutions for the \/—(1,0), given by 


J-0,D =40,1), 


Notice that /—1 has no solutions, but \/—(1,0) has two solutions. This is the origin of the 
confusion with del Ferro’s calculation. Most of his calculation used numbers of the form 
(a, 0)—written as a— except at one tiny spot where a number (0,1) shows up and later on 
cancels out. Del Ferro’s calculation makes perfect sense in the complex realm, and almost 
all of it can be reproduced with real numbers, but not all. 


A.3. Standard Notation. We can now relate the ordered pair notation we have been 
using for complex numbers with the notation used by the early mathematicians. We start 
noticing that 


(a, b) = (a,0) + (0, b) = (a,0) + (b,0)(0, 1). 


Therefore, if we write a for (a,0), b for (b,0), and we use i = (0,1), we get that every 
complex number (a,b) can be written as 


(a,b) =a-+ bi. 
Recall, a and b are the real and imaginary parts of (a,b). And the equation 
(0, iby = =, 0) 


in the new notation is 
?=-l. 


This notation (a+) is useful to manipulate formulas involving addition and multiplication. 
If we multiply (a+ bi) by (c+ di) and use the distributive and associative properties we get 


(a + bi)(c + di) = ac + adi + cbi + bdi?, 
and if we recall that i? = —1 and we reorder terms, we get 
(a + bi)(c + di) = ac — bd + (ad + bc)i. 


So, we do not need to remember the formula for the product of two complex numbers. 
With the new notation, this formula comes from the distributive and associative properties. 
Similarly, to compute the inverse of a complex number a + bi we may write 


1. 1 -@-) 

a+bi (a+bi) (a— bi) 
_ (a — bi) 

(a + bi)(a — bi) 


Notice that 
(a+ bi)(a — bi) =a? +B, 
which has only a real part. Then we can write 


1 a— bi = 1 a b 
— = t 
a+bi (a? +6?) a+bi (a?+06?) (a? +0?) 


which agrees with the formula we got in Theorem A.3. 
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A.4. Useful Formulas. The powers of 7 can have only four possible results. 


Theorem A.5. The integer powers of i can have only four results: 1, i, —1, and —1. 


Proof of Theorem A.5: We just show that this is the case for the first powers. Dy 
definition of a power zero and power one we know that 


We also know that 


P=fis=(-li=-i 
f=81=-4=-7=1 
P=fi=(li=i 
PSPia=iS—1 
= i¢=(-l1lhi=-i 


An argument using induction would proof this Theorem. 


The conjugate of a complex number a + bi is the complex number 
a+bi=a-— bi. 


For example, 
14+ 2i=1-2i, a=a, i= —4, 4i = —4i. 


If we conjugate twice we get the original complex number, that is a + bt = a+ bi. 
The modulus or absolute value of a complex number a+ bi is the real number 


la + bil = V/a? +B. 
For example 
|I3+4)=V9+16=V25=5, |at+OiJ=lel, fi|=1, [1+i=v2. 
Using these definitions is simple to see that 
(a + bi) (a + bi) = (a + bi)(a — bi) = (a? +B?) = |a 4 dil”. 


Using these definitions we can rewrite the formula in Eq. (A.3) for the inverse of a complex 
number as follows, 
1 1 


on a 


If we call z = a+ bi, then the formula for z~! reduces to 


bi). 
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1 
Example A.2. Write B+) in the form c+ di. 


Solution: You multiply numerator and denominator by 3 — 42, 


i .. 4 @=%) 
(34+44) (3+ 42) (3 — 44) 
_ 3-4) 
(32 + 4?) 
_ 34s 
~ 35 
3 4: 
=—-— i. 
25 25 
So, we have found that the i £ (3 +49) is (2 =i) < 
Oo, we Nave foun a e inverse oO 1) 18 5 25° 2 


The absolute value of complex numbers satisfy the triangle inequality. 
Theorem A.6. For all complex numbers 21, 22 holds |z, + 22| < |z:| + |Z. 
Remark: The idea of the Proof of Theorem A.6 is to use the graphical representations of 
complex numbers as vectors on a plane. Then |z,| is the length of the vector given by 2;, 


and the same holds for the vectors associated to z, and z,+ 22, the latter being the diagonal 
in the parallelogram formed by z, and z,. Then it is clear that the triangle inequality holds. 


The absolute value of a complex number also satisfies the following properties. 
Theorem A.7. For all complex numbers 21, 22 holds |z,22| = |2:| |22|. 


Proof of Theorem A.7: For an arbitrary complex numbers z, = a+ bi and z, = c+ di, 
we have 


2122 = (ac — bd) + (ad + be)i, 
therefore, 


2122]? = 


(ac — bd)? + (ad + be)? 

= (ac)” + (bd)? — 2acbd + (ad)? + (bc)? + 2adbce 
= a?c? + b?d? 4+ a2? + bc? 

= a2(e a d?) +020 +c?) 

= (a2 +0\( +c?) 


= FA esis 


Taking a square root we get 


|2122| = |21] |22l- 


This establishes the Theorem. 


Theorem A.8. Every compler number z satisfies that |z”| =|z|", for all integer n. 
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Proof of Theorem A.8: One proof uses the previous theorem A.7 and induction in n. 
For n = 2 it is proven by the theorem above, 


|27| = lez] = lz lel = lI”. 
Now, suppose the theorem is true for n — 1, so |z"~"| = |z|"~!. Then 
J2"| = Jz" 2] = [2"™ | [aI 


where we used the previous theorem A.7. But in the first factor we use the inductive 
hypothesis, 
lr" | 


Jer" lal = |e? Lal = lel”. 


So we have proven that |z"| = |z|". This establishes the Theorem. 


Remark: A second proof, independent of the previous theorem is that, for an arbitrary 
non-negative integer n we have, 


lz" |= Vn = of 2% (z)” = /(zz)" = (Vzz)” = |z|” 


Example A.3. Verify the result in Theorem A.8 for n = 3 and z = 34 4i. 
Solution: First we compute |z| and then its cube, 
jz] =|8+ 4) =V94+16=5 => |z|? = 125. 
We now compute z?, and then its absolute value, 
2 = (3+4i)(34+44)(34+ 44) =—-117444i = [24] = 1172 4 44 = 125. 


Therefore, |z|? = |z°|. As an extra bonus, we found another perfect triple, besides the 
famous 32 + 4? = 5?, which is 
44? +117? = 1257. 


A.5. Complex Functions. We know how to add and multiply complex numbers 


(a+ bi) + (c+ di) =(a+c)+(b+d)i, 
(a + bi)(c + di) = (ac — bd) + (ad + be)i. 


This means we know how to extend any real-valued function defined on real numbers having 
a Taylor series expansion. We use the function Taylor series as the definition of the function 
for complex numbers. For example, the real-valued exponential function has the Taylor 
series expansion 


n=0 


Therefore, we define the complex-valued exponential as follows. 


Definition A.9. The complex-valued exponential function is given by 


e=>" > (A.4) 
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Remark: We are particularly interested in the case that the argument of the exponential 
function is of the form z = (a + bi)t, where rz = a + bi are the roots of the characteristic 
polynomial of a second order linear differential equation with constant coefficients. In this 
case, the exponential function has the form 


: = eb bee 
elatbilt = S- ( m ; 
n=0 
The infinite sum on the right-hand side in equation (A.4) makes sense, since we know 
how to multiply—hence compute powers—of complex numbers, and we know how to add 
complex numbers. Furthermore, one can prove that the infinite series above converges, 
because the series converges in absolute value, which implies that the series itself converges. 
Also important, the name we chose for the function above, the exponential, is well chosen, 
because this function satisfies the exponential property. 


Theorem A.10 (Exp. Property). For all complex numbers z,, 22 holds e#1*72 = e*! e”2. 


Proof of Theorem A.10: A straightforward calculation using the binomial formula implies 


exitee = > (21 + Za) 


n! 


n=0 
k n! 
n=0 k=0 
Sy oe 
lin —k)V 
= k(n — k) 
! 
where we used the notation 6 =F = il This double sum is over the triangular 
te 


region in the nk space given by 
0<n<w O0<k<n. 
We now interchange the order of the sums, the indices be given by 


0<k<w kin<ow, 


so we get 
Sea EF ae 
k(n — k)! k(n — k)! 
n=0 k=0 k=0 n=k 


OO © 4k yn—k 8 09 Lk gin 
dd, lin 7 kyl dd, im 
ok cS ym 
=o) ap 
— ot eo 7 


So we have shown that e71+?2 = e*1 e72. This Establishes the Theorem. 


The exponential property in the case that the exponent is z = (a + bi)t has the form 


elatbi)t = eat eibt 
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The first factor on the right-hand side above is a real exponential, which—for a given value 
of a 4 0—it is either a decreasing (a < 0) or increasing (a > 0) function of t. The second 
factor above is an exponential of a pure imaginary exponent. These exponentials can be 
summed in a closed form. 


Theorem A.11 (Euler Formula). For any real number @ holds that e’® = cos(0) + isin(@). 


Proof of Theorem A.11: Recall that i” can have only four results, 1, i, —1, —7. This result 
can be summarized as 

Pee (-1e = Oe = Ei): 
If we split the sum in the definition of the exponential into even and odd terms in the sum 
index, we get 


a oo irer oo j2ng2n oo j2n+lg2n+l 
: = 2, nl = 23 (2n)! > (Qn+1)!’ 
and using the property above on the powers of 7 we get 
co j2rg2n co pentlg2ntl co a | ng2n co —1 ng2n+1 
ee en ee ee 
a (2n)! (2n + 1)! = (2n)! mr (2n + 1)! 


Recall that Taylor series expansions of the sine and cosine functions 


; _ co (a1)ep?e) _ ie (—1)"9?" 
sin(@) = dX “Qnty” cos(0) = » Pnyr 


Therefore, we have shown that 


e’® = cos(0) + isin(6). 


This establishes the Theorem. 


A.6. Complex Vectors. We can extend the notion of vectors with real components 
to vectors with complex components. For example, complex-valued vectors on a plane are 
vectors of the form 


v= (at bi,c+di), 
where a, 0, c, d are real numbers. We can add two complex-valued vectors component-wise. 
So, given 
v, = (a, + bi, cg, + dt), Vy. = (Az + bet, Co + dat), 

we have that 

V1 + V2 = ((a1 + Ge) + (b1 + b2)t, (C1 + C2) + (di + de)i). 
For example 

(2 + 31,4 + 5i) + (6+ 72,84 91) = (8 + 102, 12 + 142). 
We can also multiply a complex-valued vector by a scalar, which now is a complex number. 
So, given v= (a+ bi,c+ dt) and z = z, + 22%, then 

ZU = (z + %t)(a + bi,c + di) = ((z1 + 22t)(a + bi), (21 + zi)(c + di). 
For example 
i (2+ 37,44 5i) = (27 — 3, 4i — 5) 
= (-3 + 21,-5 + 4t). 
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The only non-intuitive calculation with complex-valued vectors is how to find the length 
of a complex vector. Recall that in the case of a real-valued vector v = (a,b), the length of 


the vector is defined as 
||| = /v- v= Va? +d?, 


where - is the dot product of vectors, that is, given the real-valued vectors v, = (a1, 01), 
V_ = (do, b2), their dot product is the real number 


Vz + Up = 04 4_ + Dy bo. 


We want to generalize the notion of length from real-valued vectors to complex-valued 
vectors. Notice that the length of a vector—real or complex—must be a real number. Un- 
fortunately, in the case of a complex-valued vector v = (a+ bi,c+ di) the formula /v- v 
is not always a real number, it may have a nonzero imaginary part. In order to get a real 
number for the length of a complex-valued vector we define 


Il ull = vo- v, 
where the conjugate of a vector means to conjugate all its components, that is 
0 = (a+ bi,c+ di) = (a — bi,c — di). 


We needed to introduce the conjugate in the first vector in the formula above so that the 
result is a real number. Indeed, we have the following result. 


Theorem A.12. The length of a complex-valued vector v = (a + bi,c+ di) is 


|u| =V0-0= Ve+P+24a. 


Proof of Theorem A.12: This is a straightforward calculation, 
v2 =o: v 
= (a— bi,c— di) - (a+ bi,c + di) 
= (a — bi)(a + bt) + (ce — dt) (c+ di) 
=074+P?+7 +d’. 


So we get the formula 


Il vl] = Va? + 0? + 2 + a2. 


This establishes the Theorem. 


Example A.4. Find the length of v = (1 + 2i,3 + 4%) 
Solution: The length of this vector is 
|v] = 12 + 22 + 32 + 42 = V30. 


<q 


A unit vector is a vector with length one, that is, u is a unit vector iff ||u|| = 1. 
Sometimes one needs to find a unit vector parallel to some vector v. For both real-valued 
and complex-valued vectors we have the same formula. A unit vector u parallel to v £ 0 is 

1 


uU= — Vv. 
I|v|| 
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Example A.5. Find a unit vector in the direction of v = (3 + 2i, 1 — 2%). 
Solution: First we check that v is not a unit vector. Indeed, 
lly? = 0+ v 
= (3 — 21,1 + 22) - (3 + 2%, 1 — 22) 
= (3 — 27)(3 + 27) + (1 + 22)(1 — 22) 
= 37427417 +2? 
= 14. 


Since ||v|| = V14, the vector v is not unit. A unit vector is 


1 
u= — (3—2i,1+4 2%) 


v14 


more explicitly, 


u=((sa aT | ai) 


Notes. 


This appendix is inspired on Tom Apostol’s overview of complex numbers given in his 
outstanding Calculus textbook, [1], Volume I, § 9. 
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B. Overview of Power Series 


We summarize a few results on power series that we will need to find solutions to differential 
equations. A more detailed presentation of these ideas can be found in standard calculus 
textbooks, [1, 2, 11, 13]. We start with the definition of analytic functions, which are 
functions that can be written as a power series expansion on an appropriate domain. 


Definition B.1. A function y is analytic on an interval (ao —p, Xo +p) iff it can be written 
as the power series expansion below, convergent for |x — xo| < p, 


yl) = So an (a — 2)". 
n=0 


Example B.1. We show a few examples of analytic functions on appropriate domains. 


1 
(a) The function y(a) = i is analytic on the interval (—1,1), because it has the power 
—2£ 
series expansion centered at 2) = 0, convergent for || < 1, 


CoO 


1 
; FED Maleate tate, 
n=0 


It is clear that this series diverges for x > 1, but it is not obvious that this series 
converges if and only if |z| < 1. 


(b) The function y(%) = e® is analytic on R, and can be written as the power series 


c) A function y having at x) both infinitely many continuous derivatives and a convergent 
g y y i 
power series is analytic where the series converges. The Taylor expansion centered at 
Z of such a function is 


2 () (55 
ula) = > 9) (ay) 
n=0 . 
and this means 
ul) = u(0e) +f () (ea) + EP) (@— 20)? + (a 04)? + 


The Taylor series can be very useful to find the power series expansions of function 
having infinitely many continuous derivatives. 


Example B.2. Find the Taylor series of y(x) = sin(a) centered at 2) = 0. 


Solution: We need to compute the derivatives of the function y and evaluate these deriva- 
tives at the point we center the expansion, in this case x = 0. 


y(x) = sin(x) = y(0)=0, — y"(x) = cos(x) => y(0)=1, 
y" (x) = —sin(z) > y"(0)=0, y’"(x) = —cos(z) > y"(0) =-1. 
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One more derivative gives y“(t) = sin(t), so y“ = y, the cycle repeats itself. It is not 
difficult to see that the Taylor’s formula implies, 


3 5 co —1)r 
sin(x) =2——+——... > sin(a) = > go an, 


n=0 


Remark: The Taylor series at x) = 0 for y(x) = cos(x) is computed in a similar way, 


cos(x) = S- ‘Gat grr, 


n=0 


Elementary functions like quotient of polynomials, trigonometric functions, exponential 
and logarithms can be written as power series. But the power series of any of these functions 
may not be defined on the whole domain of the function. The following example shows a 
function with this property. 


centered at x) = 0. 


1 
Example B.3. Find the Taylor series for y(a) = 


Solution: Notice that this function is well 
defined for every x € R— {1}. The func 
tion graph can be seen in Fig. ??. To find 
the Taylor series we need to compute the n- 
derivative, y‘” (0). It simple to check that, 
n! 


yO) = (1 —ax)r*1’ 50 y (0) =n. 


1 —- n 
We conclude that: y(x) = ice = 27 : 
One can prove that this power series con- 
verges if and only if |a| < 1. <l 
1 
FIGuRE 1. The graph of y = : 
(1-2) 
Remark: The power series y(x) = S- x” does not converge on (—oo, —1]U[1, 00). But there 
n=0 


are different power series that converge to y(x) = 7 on intervals inside that domain. 
x 


For example the Taylor series about x) = 2 converges for |a — 2| <1, that isl<a< 3. 


M@)=_ B= M@=—e = wwe)= Cy e-2. 
, (ay : ene Wad de 


Later on we might need the notion of convergence of an infinite series in absolute value. 
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Co 
Definition B.2. The power series y(x) = » An (2 — 2%)" converges in absolute value 
n=0 


Co 
iff the series S- |an| |a — xo|" converges. 


n=0 
Remark: If a series converges in absolute value, it converges. The converse is not true. 


oa 


n 


co 
Example B.4. One can show that the series s = oe converges, but this series does 
n=1 


co 
I 
not converge absolutely, since s — diverges. See [11, 13]. <J 
nr 
n=1 


Since power series expansions of functions might not converge on the same domain where 
the function is defined, it is useful to introduce the region where the power series converges. 


co 
Definition B.3. The radius of convergence of a power series y(x) = > Gy, (4 — 2%)” 
n=0 


is the number p > 0 satisfying both the series converges absolutely for |x — zo| <p and the 
series diverges for |x — xo| > p. 


Remark: The radius of convergence defines the size of the biggest open interval where the 
power series converges. This interval is symmetric around the series center point Xo. 


diverges converges diverges 


Lo — p Xo Lo + P od 
FIGURE 2. Example of the radius of convergence. 


Example B.5. We state the radius of convergence of few power series. See [11, 13]. 


1 fore) 
(1) The series : = iy x” has radius of convergence p = 1. 


n=0 
(2) The series e” = :2 = has radius of convergence p = oo. 
n: 
n=0 
: : = (=i (2n+1) : 
(3) The series sin(a) = = n+)! x has radius of convergence p = co. 
Tr 7 


n=0 


lee) ah 
(4) The series cos(x”) = S- 5 : a") has radius of convergence p = 00. 
n)! 


— i 
(5) The series sinh(x) = + n+)! 2?"+) has radius of convergence p = oo. 
Tr : 


ae 
(6) The series cosh(x) = ¥ x?" has radius of convergence p = 00. 
TL): 
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One of the most used tests for the convergence of a power series is the ratio test. 
co 
Theorem B.4 (Ratio Test). Given the power series y(x) = S- An (x — %o)", introduce 
n=0 


the number L = lim Jana 
noo |an| 


. Then, the following statements hold: 


(1) The power series converges in the domain |x — xp|L < 1. 
(2) The power series diverges in the domain |x — xo|L > 1. 
(3) The power series may or may not converge at |x — xo|L = 1. 


il 
Therefore, if L £0, then p = — is the series radius of convergence; if L = 0, then the radius 
a= 


of convergence is p = 00. 


Remark: The convergence of the power series at %) + p and 2 — p needs to be studied on 
each particular case. 


Power series are usually written using summation notation. We end this review men- 
tioning a few summation index manipulations, which are fairly common. Take the series 


y(x) = ap +. a1 (x — 29) + ao(x — 29)* +-:-, 


which is usually written using the summation notation 
Co 
y(z) = Do an (2 — 20)”. 
n=0 


The label name, n, has nothing particular, any other label defines the same series. For 
example the labels k and m below, 


co co 
y(x“) = S- ay (2 — t9)* = S- dans (= 0)", 
k=0 m=—3 
In the first sum we just changed the label name from n to k, that is, k =n. In the second 
sum above we relabel the sum, n = m+ 3. Since the initial value for n is n = 0, then the 
initial value of m is m = —3. Derivatives of power series can be computed derivating every 
term in the power series, 


loc) co 
y' (x) = S- Nay (2 — a9)" * = > Nan (x — 29)" ! = ay + 2aa(a — ay) +--- 
n=0 n=1 


The power series for the y’ can start either at n = 0 or n = 1, since the coefficients have a 
multiplicative factor n. We will usually relabel derivatives of power series as follows, 


co co 
ye) = So nan (n—2)" = (m +1) dm4i (@ — %0)™ 
n=1 m=0 


where m = n—1, that is,n =m-+1. 
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C. Discrete and Continuum Equations 


A differential equation is an equation, where the unknown is a function and at least one of its 
derivatives appears in the equation. Differential equations are essential for a mathematical 
description of nature—they lie at the core of many physical theories. In this section we 
show that differential equations can be obtained as a certain limit of difference equations. 
We focus on a specific problem—a quantitative description of bacteria growth having 
unlimited space and food. We first measure the bacteria population at fixed time intervals, 
then we repeat the measurements at shorter and shorter time intervals. We write our 
measurements in a difference equation for a discrete time interval variable. We solve this 
difference equation, obtaining the bacteria population as a function of the initial population 
and the number of time intervals passed from the start of the experiment. We then compute 
a very particular limit on the difference equation, called the continuum limit. In this limit 
the time interval goes to zero and the number of time intervals goes to infinity so that their 
product remains constant. We will see that the continuum limit of the difference equation 
in this section is a differential equation, called the population growth differential equation. 


C.1. The Difference Equation. We want to know how bacteria grows in time when 
they have unlimited space and food. To obtain such equation we observe—very carefully— 
how the bacteria grows. We put an initial amount of bacteria in a small region at the 
center of a petri dish, which is full of bacteria nutrients. In this way the bacteria population 
has unlimited space and food to grow for a certain time. The bacteria population is then 
proportional to the area in the petri dish covered in bacteria. With this setting we will 
perform several experiments in which we measure the bacteria population after regular time 
intervals. 


ro, Beclen'n al Boclerca al 
Loewe t 


Zeme 7440. 


FIGURE 3. Bacteria growth experiment with unlimited food and space. 


First Experiment: 


e fix the time interval between measurements by At; = 1 hour. 
e denote the bacteria population after n time intervals as P(nAt,) = P(n), 
e introduce the initial bacteria population P(0), 


Our first measurement is P(1), the bacteria population after 1 hour. It is convenient to 
write P(1) as follows 


P(1) = P(0) + AP, 
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where AP, is what we actually have measured, and it is the increment in bacteria population. 
In the same way we can write our first n measurements, 


P(1) = P(0) + AP, 
P(2) = P(1) + APy, 


P(n) Z P((n—1)) + AP,, (C.1) 


where AP;, for j = 1,---+ ,n, is the increment in bacteria population at the measurement 
j relative to the measurement 7 — 1. If you actually do the experiment—and if you look 
carefully enough at the AP,, carefully enough—you will find the following: The increment in 
the bacteria population AP,, is not random, but it follows the rule 


AP, = K, P(n— 1), (C.2) 


where ky depends on the type of bacteria and on the fact that we are measuring by At, = 1 
hour. This last equation means that the growth of the bacteria population is proportional 
to the existing bacteria population. We use Eq. (C.2) in Eq. (C.1) and we get the formula 


P(n) = P(n—-1)+ ki P(n- 1), = 1 Bens NV, (C.3) 


where N is the last time we measure, probably when the bacteria population fills the whole 
petri dish. This is the end of our first experiment. 


Second Experiment: We reduce the time interval At when we take measurements. Now 
At = 30 minutes, that is, At2 = 1/2 hours. Since At, is no longer 1, we need to include 
it in the argument of P. If you carry out the experiment, you will find that Eq. (C.3) still 
holds for this case if we introduce Atg. = At,/2 as follows, 


P(nAt2) = P((n — 1)Atz) + Ko P((n—1)Ate), = n= 1,2,---,N. (C4) 


In this experiment we have to measure the new constant AK». You will find that Ky = K,/2. 
This is reasonable, the bacteria population grows in 80 minutes half it grows in one hour. 
This is the end of our second experiment. 


m-th Experiment: We now carry out many more similar experiments. For the m-th 
experiment we use a time interval At,,, = At; /m, where At; = 1 hour. If you carry out all 
these experiments, you will find the following relation, 


P(nAtm) = P((n— 1)Atm) + Km P((n — 1)Aty), (ae eee (C.5) 
where K,, = Ki/m. 


By looking at all our experiments, we can see that the constant K,, is in fact proportional to 
the time interval At, used in the experiment, and the proportionality constant is the same 
for all experiments. Indeed, 


ky 


Ky At 
Km = > Km = a 
m 


vara Km = Atm, 
At, m " 


where the constant r = K,/At, depends only on the type of bacteria we are working with. 
Since the constant K,, in any of the experiments above is proportional to the time interval 
At used in each experiment, we can simplify the notation and discard the subindex m, 


K=rAJt. 
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Then, the final conclusion of all our experiments is the following: the population of bacteria 
after n time intervals At > 0 is given by the equation 


P(nAt) = P((n — 1)At) + r At P((n — 1)At), (C.6) 


where r is a constant that depends on the type of bacteria studied and n = 1,2,---. This 
equation is a difference equation, because the argument of the population function takes 
discrete values. We call equation (C.6) the discrete population growth equation. The physical 
meaning of this constant r is given in the equation above, 


AP 1 
r=— = 
At P 
where AP = P(nAt) — P((n — 1)At) and P = P((n — 1)At). So r is the rate of change in 


time of the bacteria population per bacteria, that is, a relative rate of change. 


C.2. Solving the Difference Equation. The difference equation (C.6) relates the 
bacteria population after n time intervals, P(nAt), with the bacteria population at the pre- 
vious time interval, P((n — 1)At). To solve a difference equation means to find the bacteria 
population after n times intervals, P(nAt), in terms of the initial bacteria population, P(0). 
The difference equation above can be solved, and the result is in the following statement. 


Theorem C.1. The difference equation 
P(nAt) = P((n — 1)At) + r At P((n — 1)At), 
relating P(nAt) with P((n —1)At) has the solution 
P(nAt) = (1+r At)” P(0), (C.7) 


relating P(nAt) with P(O). 


Proof of Theorem C.1: Eq. (C.6) can be rewritten as 
P(nAt) = (1+7r At) P((n— 1)Ad), 
but we can also rewrite the expression for P((n — 1)At) in a similar way, 
P((n— 1)At) = (1+ r At) P((n— 2)At), 
and so on till we reach P(0). Therefore, 


P(nAt) = (1 +r At) P((n— 1)At) 
= (1+ r At)? P((n— 2)At) 


= (1+rAt)” P(0). 
So, we have solved the discrete equation for population growth, and the solution is 


P(nAt) = (1+r At)” P(0). 


This establishes the Theorem. 
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C.3. The Differential Equation. We want to know what happens to the difference 
equation (C.6) and its solutions (C.7) in the continuum limit: 


At—+0, nAt=t>0_ is constant. 


We call it the continuum limit because At — 0, so we look more and more often at the 
bacteria population, and then n — oo, since we are making more and more observations. 
Rather than doing an experiment to find out what happens, we work directly with the 
discrete equation that models our bacteria population. 


Theorem C.2. The continuum limit of the discrete equation 
P(nAt) = P((n — 1)At) +r At P((n — 1)At), 


is the differential equation 


Pyar Pe: (C.8) 


Remark: The equation (C.8) is a differential equation because both P and P’ appear in 
the equation. It is called the exponential growth differential equation because its solutions 
are exponentials that increase with time. 


Proof of Theorem C.2: We start renaming n as n+ 1, then Eq. (C.6) has the form 
P((n+1)At) = P(nAt) + r At P(nA). 
From here it is simple to see that 
P(nAt + At) — P(nAt) = r At P(nA). 
We now use that n At = t, then the equation above becomes 
P(t + At) — P(t) =r At P(t). 
Dividing by At we get 
P(t + At) — P(t) 


At 


The continuum limit is given by At > 0 and n > oo such that n At = t is constant. For 
each choice of t we have a particular limit. So we take such limit in the equation above, 


=r P(t). 


_ P(t+At)— P(t) _ 
aoe 


Since t is held constant and At — 0, the left-hand side above is the derivative of P with 
respect to ft, 


a Pet At) = Plt) 


This establishes the Theorem. 
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C.4. Solving the Differential Equation. We now find all solutions to the exponen- 
tial growth differential equation in (C.8). By a solution we mean a function P that depends 
on time such that its derivative is r times the function itself. 


Theorem C.3. All the solutions of the differential equation P’(t) =r P(t) are 
P(t) = Po eo (C.9) 


where Po is a constant. 


Remark: The constant Pp in (C.9) is the initial population, P(0) = Pp. 

Proof of Theorem C.3: To find all solutions we start dividing the equation by P, 
P(t) _ 
P(t) 


We now integrate both sides with respect to time, 


P\(t) 
dt = dt. 
/ P(t) / ’ 
The integral on the right-hand side is simple to do, we need to integrate a constant, 


P(t) 
/ Pit) dt =rt+co, 


where co is an arbitrary constant. On the left-hand side we can introduce a substitution 
p=P@) => dp=P'(t)dt. 
Then, the the equation above becomes 


d 
ee em 
Pp 


The integral above is simple to do and the result is 
In |p| = rt + co. 


We now replace back p = P(t), and we can solve for P, 


In|P(t)|=rt+e => |P(t)| =e =e eo > P(t) = (te%)e™. 


We denote c = (+e°°), then all the solutions to the exponential growth equation, 
PU) =¢e™, ceER. 


The constant c is the initial population. Indeed, given an initial population Po, called an 
initial condition, then it fixes the constant c, because 


Py = P(0) =ce®°=c c= Pp. 


Then the solution of the differential equation with an initial population Pp is 


P(t) = Poe™. 


This establishes the Theorem. 


Remark: We see that the solution of the differential equation is an exponential, which is 
the origin of the name for the differential equation. 
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C.5. Summary and Consistency. By carefully observing how bacteria grow when 
they have unlimited space and food we came up with a diference equation, Eq. (C.6). We 
were able to solve this difference equation and the result was Eq. (C.7). We then studied 
what happened with the difference equation in the continuum limit—we look at the bacteria 
at infinitely short time intervals. The result is a differential equation, the exponential growth 
differential equation (C.8). Recalling calculus ideas we were able to find all solutions of this 
differential equation, given in Eq. (C.9). We can summarize all this as follows, 


Discrete description At > 0 Continuous description 
P(nAt) = (1+7r At) P((n— 1)At) = P=" PG} 
1 1 
Soving the equation Solving the equation 
1 1 
P(nAt) = (1+ r At)” Py a P(t) = Pye™ 


We are now going to show the consistency of the solutions. We have a solution of the 
discrete equation, we have a solution of the continuum equation, and now we show that the 
continuum limit of the former is the latter. 


Theorem C.4 (Consistency). The continuum limit of the solutions of the difference equa- 
tions are the solutions of the differential equation, 


P(nAt) =(1+rAt!"PR —- P(t)=Pe™. 


Proof of Theorem C.4: We start with the discrete solution given in Eq. (C.7), 
P(nAt) = (1+ rAt)” Po, (C.10) 


and we recall that t = nAt, hence At = t/n. So we write 


Now we need to study the limit of the expression above as n — oo while ¢ is constant in 
that limit. This is a good time to remember the Euler number e, 


. 1\” 
e= lim (1+_) 
n 


noo 
satisfies that 
x n 
e” = lim (1+ =) : 
n—- co nm 
Using the formula above for 7 = rt we get 
t\ nr 
lim (1 + ~) =e, 
n 


n—->oco 


With all this we can write the continuum limit as 


t n 
P(t) = lim (1 a: ~~) Py =e" B 
n> Co n 
But the function 
P(t) = Poe™ 


is the solution of the differential equation obtained using methods from calculus. This 
establishes the Theorem. 
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In the following examples we provide a table with data from different physical systems. 
Then we find the difference equation that describes such data and its solution. After that 
we compute the continuum limit, which gives us the differential equation for that system. 
We finally solve the differential equation. 


Example C.1. The population of bees in a state, given in thousands, is given by 


Year 2000 | 2002 | 2004 | 2006 | 2008 | 2010 
Population | 2 10 50 | 250 | 1250 | 6250 


Model these data using exponential growth model, denoting by P(t) the bee population 
in thousands at time t, with time in years since the year 2000. For example, for the year 
2008, the variable t is 8. Consider a discrete model for the data in the table above given by 


P((n+ 1)At) = P(nAt) + k At P(nAt). 


(1) Determine the growth-rate coefficient & using the data for the years 2000 and 2002. 

(2) Determine the growth-rate coefficient & again, this time using the data for the 
years 2008 and 2010. 

(3) Use the value of k found above to write the discrete equation describing the bee 
population. Write At instead of the time interval in the table. 

(4) Solve the discrete equation for the bee population. 

(5) Find the continuum differential equation satisfied by the bee population and write 
the initial condition for this equation. 

(6) Find all solutions of the continuum equation found in part (5). 


Solution: 
(1) The growth coefficient computed using the years 2000 and 2002 is 
(10-2) 1 


pee te es k = 2. 
(2002 — 2000) 2 


(2) The growth coefficient computed using the years 2008 and 2010 is 
(6250-1250) 1 


(2010 — 2008) 1250 


(3) We now use k = 2 and At arbitrary to write the discrete equation that describes 
the data in the table. We denote 


P(n+1) = P((n+ 1)At), Pn = P(nAt), 
then, the discrete equation is 
P(n+1) = Pn+rAtPn, 


which is the analogous to Eq. (C.6). 
(4) Since 


Pn=(1+rAt) P(n—1), 
P(n—1)=(14+rAt) P(n- 2), 


=> Pn=(1+rAt) P(n—2), 


repeating this argument till we reach Po we get 


Pn =(1+rAt)” Po. 
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(5) The continuum equation is obtained from the discrete equation taking the contin- 
uum limit: 
At—>0, n—-+oco suchthat nAt=teR. 
Using the discrete equation in (3) we get 


P 1)-—P 
P(n+1)—Pn=rAtPn => Pons Pn + Pn. 


If we write what P(n +1) and Pn actually are, we get 
P(nAt + At) — P(n At) 


© =r P(nAt). 
Since n At = t, we replace it above, 
P(t + At) — 
ama eel ey 31) 


At 
Since At > 0 we get the continuum equation 


P’(t)=r P(t). 


(6) To solve the continuum equation we rewrite it as follows, 


P Pt 
par i ar = frat => In(|P|)=rt+o, 
where ¢ € R is an arbitrary integration constant, and we In(|P|)’ = P’/P. Then, 
P(t) = +e =+e%e™, denote qo =te® => Plt)=aqe", qeER. 


The constant c; is determined by the initial population P(0) = Py. Indeed 
Po P(O) Cy e? Cy Cy Po 


therefore we get that 


Example C.2. A bacteria population increases by a factor (1+ 8 At) in a time period At. 
Every At we harvest an amount of bacteria 20 At. 


(a) Write the discrete equation that relates the bacteria population at (n + 1) At with 


the bacteria population at n At. 


(b) Find the continuum limit in the discrete equation found in part (a) above. Recall 


that the continuum limit is n — oo and At > 0 so that n Dt = t is constant in that 
limit. 


(c) Solve the differential equation in part (b) in the case there is an initial population of 


100 bacteria. 


Solution: 


(a) We know that the bacteria population P increases by a factor (1+ 8 At during the time 


interval At. If we forget that we harvest bacteria, then after (n+1) time intervals 
the bacteria population is 


P((n+1)At) = (1+ 8At) P(n At). 
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The equation above does not include the fact that we harvest 20 At bacteria every time 
interval At. If we include this fact we get the equation 


P((n + 1)At) = (1+ 8 At) P(n At) — 20 At, 


where the negative sign in the last term indicates we reduce the population by that 
amount when we harvest. 


(b) The continuum limit is computed as folllows: If we see a product n At, we replace it by 


t, that is, 


P(nAt + At) =(14+8At)P(nAt)-20At ©— P(t+At) =(1+8At) P(t) —20At. 


WN 


We now reorder terms such that we get an incremental quotient on the left-hand side, 
P(t + At) — P(t) 
At 
Now we take the limit At + 0 keeping ¢ constant, and we get the continuum equation 

P'(t) = 8 P(t) — 20. 


We now need to solve the differential equation above. We do the same calculation we 
did in the case of zero harvesting. 


P(t + At) — P(t) = 8 At P(t) — 20 At 


= 8 P(t) — 20. 


P'(t) 
P'(t) =8 P(t) — 20 or dt = | dt. 
) (6) ~ @&PH-20)  ~ laa — 20) i 
On the left-hand side above we substitute u = 8 P(t) — 20, so du = 8 p’(t) dt. Then, 
1d 
[azn fe => j mlul=t+er =>  In|8 P(t) — 20| = 8t + 8c. 
U 


We compute the exponential of both sides, 
[8 P(é) — 20| = eS# +81 — e842 => 8 P(t) — 20 = (He®) e®™*. 
If we call cp = (+e8*'), we get that 


2 
BP(t)- =e + Plt)=Se+ ~ 
and again relabeling the constant c = c2/8 we get that 
5 
P(t) =ce®* + 5" 


We know that at time t = 0 we have P(0) = 100 bacteria, which fixes the constant c, 
because 


5) 5 5 195 
1 =P = e =o = —rall --= —. 
00 (0) =ce +5 e+ 5 => c=100 ; 5 
So the continuum formula for the bacteria population is 
195 g 5 


C.6. 
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Exercises. 


9.3.1.- The fish population in a lake, given in hundred thousands, is given by 


| Year 2000 | 2001 | 2002 ) 2003 2004 2005 
| Population 3 4.5 | 6.75 | 10.125 | 15.1875 | 22.78125 


Model these data using exponential growth model, denoting by P(t) the fish population 


in hundred thousands at time t, with time in years since the year 2000. For example, for the 
year 2005, the variable t is 5. Consider a discrete model for the data in the table above given 


by 


P((n + 1)At) = P(nAt) +k At P(nAt). 


Determine the growth-rate coefficient k using the data for the years 2000 and 2001. 
Determine the growth-rate coefficient k again, this time using the data for the years 2004 
and 2005. 

Use the value of k found above to write the discrete equation describing the fish popula- 
tion. Write At instead of the time interval in the table. 

Solve the discrete equation for the fish population. 

Find the continuum differential equation satisfied by the fish population and write the 
initial condition for this equation. 

Solve the the initial value problem found in the previous part. 

Use a computer to compare the solutions to the discrete equation (with any At 4 0) and 
continuum equation for the fish population. 


9.3.2.- A bacteria population increases by a factor r in a time period At. Every At we harvest an 
amount of bacteria P, At, where P», is a fixed constant. 


(a) 
(b) 


Write the discrete equation that relates the bacteria population at (n + 1) At with the 
bacteria population at n At. This equation is similar, but not equal, to Eq. (C.6) above. 
Find the continuum limit in the discrete equation found in part (a) above. Recall that 
the continuum limit is m > co and At > 0 so that n At = t is constant in that limit. 
Denote by P(t) the bacteria population at the time t. 

Solve the differential equation in part (b) in the case there is an initial population of Pp 
bacteria. 

The solution of the continuum differential equation in part (b) above also holds in the 
case that the initial population of bacteria is smaller than P,,/r. So, consider the case 
where Po < P,/r and find the time t; such that the bacteria population vanishes. 


9.3.3.- The amount of a radioactive material decreases by a factor r = 1/2 in a time period At. 


(a) 


(b) 
(c) 


Write the discrete equation that relates the amount of radioactive material at (n +1) At 
with the radioactive material at n At. This equation is similar, but not equal, to Eq. (C.6) 
above. 

What is the main difference between a radioactive decay system and a bacteria population 
system? 

Take the continuum limit in the discrete equation found in part (a) above. Recall that 
the continuum limit is n — oo and At > 0 so that n At = t is constant in that limit. 
Denote by N(t) the amount of radioactive material at the time ¢. You should obtain the 
radioactive decay differential equation. 
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(d) Solve the radioactive decay differential equation. Denote by No the initial amount of the 
radioactive material. ae 

(e) The half-life of a radioactive material is the time 7 such that N(7) = — Find the 
half-life of radiative material in this problem. Find an equation relating the half life 7 
with the radioactive decay constant r. 
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D. Review Exercises 
Coming up. 
E. Practice Exams 


Coming up. 
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F. Answers to exercises 


Chapter 1: First Order Equations 


Section 1.1: Linear Constant Coefficient Equations 


Ld- y! =5y +2. 
1.1.2.- a=2andb=3. 
1.1.3.- y=ce*™, force R. 


1.1.4.- y(t) =ce* 4 - with ce R. 
1.1.5.- y(t) =ce* — : 

9 —At 1 z 
1.1.6.- y(z) = 5° + 3? with ce R. 


1.2.1.- y(t) = ce. 
1.2.2.- y(t) =ce'—e~*, with cE R. 
1.2.3.- y(t) = 2e’ + 2(t — 1) e”*. 


a  cos(t) 
1.2.4.- ‘j= = - 
WI sa a 
1.2.5.- y(t) = cet (42), with cE R. 
1.2.6.- y(t) = Ee net 
2.6.- y(t) = 79 + aa With CER. 


1.2.7.- y(t) =3e". 


1.2.8.- y(t) = ce’ +sin(t) + cos(t), for 
allc ER. 


Section 1.3: Separable Equations 


3 


1.3.1.- Implicit form: v = 5 +c. 
Explicit form: y = + 7 + 2c. 
1.3.2.- y4+y+8—t=c, withceR. 
1.3.3. y(t) =, _ 

1.3.4.- y(t) = cee, 


1.3.5.- y(t) =t In(|t|) +c). 


1.1.10.- * Not given. 


Section 1.2: Linear Variable Coefficient Equations 


1.2.9.- y(t) = —¢? + # sin(4t). 
1.2.10.- Define v(t) = 1/y(t). The 
equation for v is v’ = tu—t. Its solution 
is v(t) = ce’ 24.1, Therefore, 


1 
y(t) — cet?/2 Z 


N 
1.2.11.- y(#) = 6tee* /*) 
a, 
1.2.12.- y(x)= 4e%— 3) 


1.2.13.- * Not given. 


2 a * 
1.3.6.- y?(t) = 2¢? In((t|) +c). 
1.3.7.- Implicit: y? + ty — 2t = 0. 
Explicit: y(t) = - —t+ Vt? + 8t). 
1.3.8.- Hint: Recall the Defini- 
tion 1.3.4 and use that 

; S 

yi (x) = f x, yi(x)), 
for any independent variable x, for ex- 
ample for x = kt. 


1.3.9.- * Not Given. 
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Section 1.4: Exact Equations 


1.4.1.- 


(a) The equation is exact. N = (1+#?), 
M = 2ty, so ON = 2t=d,M. 

(b) Since a potential function is given 
by W(t, y) = t? y +y, the solution is 


Cc 
iS R. 
y(t) Pai °& 
1.4.2.- 


(a) The equation is exact. We have 
N =tcos(y) — 2y, M =t+sin(y), 


O,N = cos(y) = OyM. 
(b) Since a potential function is given 


? ; 
by w(t.y) = > + tsin(y) - y’, the 


solution is 
2 


t : 

z + tsin(y(t)) — 9°) =e, 
for cE R. 

1.4.3.- 


(a) The equation is exact. We have 
N=—2y+te”, M=2+ye, 


O.N = (1 + ty) ely = OyM. 

(b) Since a potential function is given 
by w(t, y) = 2t+e — y’, the solu- 
tion is 

at + eh _ y(t) =c, 
for cE R. 


da 
(a) u(x) = ns 

(b y° —3ay+ =o? = 1. 
1.4.5.- 


(a) p(x) = 2. 
(b) y?(x* + 1/2) = 2. 


(c) y(x) = ae The negative 


square root is selected because the 
the initial condition is y(0) < 0. 
1.4.6.- 


(a) The equation for y is not exact. 
There is no integrating factor de- 
pending only on x. 

(b) The equation for x = y~' is not ex- 
act. But there is an integrating fac- 
tor depending only on y, given by 


u(y) =e". 
(c) An implicit expression for both y() 
and x(y) is given by 


—3re " + sin(5x) e” =, 
for cE R. 
1.4.7.- * Not Given. 
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Section 1.5: Applications 


1.5.1.- 


(a) Denote m(t) the material mass as 
function of time. Use m in mgr and 
t in hours. Then 

m(t) = moe *, 


where mo = 50 mgr and k = In(5) 


hours. ‘ 
(9) mle) = 3 me 
(c) T= es hours, so 7 ~ 0.43 hours. 
1.5.2.- 
(a) We know that (AT)! = —k (AT), 


where AT = T — Ts and the cooler 
temperature is T, = 3 C, while k is 
the liquid cooling constant. Since 
T; = 0, 
T’' =—k(T —3). 
(b) The integrating factor method im- 
plies (T’ +k T)e™ = 3k e*", so 
rey - 3eM)! =0. 
Integrating we get (T — 3)e** = 
c, so the general solution is T = 
ce *'43. The initial condition im- 
plies 18 = T(0) = c+3, soc = 15, 
and the function temperature is 
T(t) =15e +3. 


(c) To find k we use that T'(3) = 13 C. 
This implies 13 = 15 e~** +3, so we 


arrive at 
3, 13-3. 2 
e = =, 
15 3 
which leads us to —3k = In(2/3), so 
we get 
ae 5 in(3/2). 


1.5.3.- Since 

Q(t) = Que Wo! VOM, 
the condition 

Q1 = Qoe7(2/Vo)t 
implies that 


Therefore, t; = 20 In(5) minutes. 
1.5.4.- Since 
Q(t) =Voqa 1- area") 
and 
too 
the result in this problem is 
*% —t/50 

Q(t) = 300 1-e ) 
and 

lim Q(t) = 300 grams. 

too 


1.5.5.- Denoting Ar = r; — ro and 
V(t) = Art + Vo, we obtain 


aw = [PE] e 
+0 [veo vf] * 


A reordering of terms gives 
_ Vo |B, 
Q(t) = aV (6) ~ [Zap] *” (avo ~ Qo) 
and replacing the problem values yields 
(200)? 
(t + 200)?” 
The concentration q(t) = Q(t)/V(t) is 
fw eye 
a) =a-[ppl” (@- %): 
The concentration at V(t) = Vin is 
Vo art Qo 
Val (#- %): 
which gives the value 


Q(t) = t + 200 — 100 


Gm = 9 — | 


eae rams/liter 
qm = 125 & : 


In the case of an unlimited capacity, 
limt+oo V(t) = 00, thus the equation for 
q(t) above says 

lim q(t) = qi. 


too 
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Section 1.6: Nonlinear Equations 


1.6.1.- 
Yo = 0, 
y= t, 
yo =t+ 30°, 
ys =t+ 30? + 60. 
1.6.2.- 
yo = 1, 
yr = 1+ 8b, 
(a) 


yo = 1+ 8t+ 122, 
Ya = 1486+ 128 +124. 


1.6.3.- 

(a) Since y = \/y3 — 4t?, and the ini- 
tial condition is at t = 0, the solu- 
tion domain is 


. Yo 
(b) Since y= 1— fy 


condition is at t = 0, the solution 
domain is 


and the initial 


1.6.4.- 
(a) Write the equation as 
,__ 2In(t) 
~ =a" 
The equation is not defined for 


t=0 t=2. 


This provides the intervals 
(—00, —2), (-2, 2), (2, oo). 


Since the initial condition is at t = 
1, the interval where the solution is 
defined is 


D = (0,2). 
(b) The equation is not defined for 
t=0, t=3. 
This provides the intervals 
(—o0,0), (0,3), (3,00). 


Since the initial condition is at t = 
—1, the interval where the solution 
is defined is 


D = (—oo, 0). 
1.6.5.- 
bi ge =t 
(b) Outside the disk t? +y? <1. 
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Chapter 2: Second order linear equations 


Section 2.1: Variable Coefficients 


2.01... 2.1.2.- 
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Section 2.2: Reduction Order Methods 


2.2.1.- 

(a) vo! = —(3/t)v + 3/t?. 
(b) v(t) = 3/(2t). 

(c) y(t) = (38/2 In(t) + 3. 


2e212s= 

(a) w= —3u/y 

(b) w(1) =5. 

(c) wly) =5/y%. 

(d) y(t) = 5/(y(@))’, with y(01) = 1. 
(e) y(t) = (20t+ 1)", 


2:2.3.- y(t) = 74/6 +1)". 
2.2.4.- yo(t) = c/t*, with cE R. 
2.2.5.- * Not given. 


Section 2.3: Homogeneous Constant Coefficient Equations 


2.3.1.- 

(a) r+ = 4, r- =3. 

(b) y(t) = et, y-(t) =e. 
(c) y(t) = —4e*% 4+ 5e?!. 


2.3.2.- 
5 
(a) r+ =44 31, r-=4- 31 
(b) y(t) = e** cos(3t), 
y-(t) = e* sin(3t). 
(c) y(t) = 2e**(cos(3t) — sin(3t)). 


Section ??: Repeated Roots 


27.27. 


Section ??: Undetermined Coefficients 


27.77. 


Section ??: Variation of Parameters 


77.277. . 


2.3.3.- 

(a) mr =r-=3. 

(b) y+(t) =e, y-(t) = te*. 
(c) y(t) =e"(1 +2). 


2.3.4.- * Not given. 


27.77. 


Ue Goss, 


27.77. 
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Chapter 3: 
Section 3.1: 


3.1.1.- 
Section 2.4: 

2.4.1.- 
Section 3.2: 


3.2.1.- 
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Power Series Solutions 


Regular Points 


The Euler Equation 


Regular-Singular Points 


3.1.2.- 


2.4.2.- 


3.2.2.- 
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Chapter 4: The Laplace Transform 


Section 4.1: Introduction to the Laplace Transform 


4.1.1.- 
1 1 

wo-(4-3) 
(a) In iN 4 
b) I=-—. 
b) r= 4 
4.1.2.- 

SNS 

a) In =—= e- —e. : 
( ) I 1 sN ry 
(b) I= , for s > 0. I diverges for 

s<0 
4.1.3.- 

: 1 s 2)N 
a N =o © = . 
( ) I (s—2) et 1) 

s) = —s, fors>2. 
b) F cy f 2 
4.1.4.- 
(a) 
ree N eT St2N 1 “S428 _4) 
(s+ 2) (s+ 2)? : 

(b) F(s) => a for s > 2. 


Section 4.3: Discontinuous Sources 


4.3.??.- 


Section 4.4: Generalized Sources 


4.4.1.- * Not Given. 


Section 4.5: Convolution Solutions 


77.77. 


4.1.5.- 
(a) 

82 es 
In =— ae (- sin(2N) + 2 cos(2N)) 

2 
+ 3 
(b) F(s) = ay for s > 0. 
4.1.6.- 
(a) 
s?e 8X 4, 
In ar cos(4N) + 5 sin(4N)) 
8 
s? + 42° 
8 

(b) F(s) = aoe for s > 0. 
4.1.7.- 7 
L{sinh(at)] = ge? for s > jal. 
4.1.8.- * 2 
L{cosh(at)] = og for s > |a|. 
4.3.2??.- 
4.4.2.- 


27.77. 


BeatriceGloria_personal library 
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Chapter 5: Systems of Linear Differential Equations 


Section ??: Introduction 

eg ena 22.27. 
Section 8.1: Systems of Algebraic Equations 

8.1.1.- . 8.1.2.- 
Section 8.2: Matrix Algebra 

8.2.1.- . 8.2.2.- 
Section 5.1: Linear System of Differential Equations 

5.1.77.- . 5.1.77.- 
Section 8.3: Diagonalizable Matrices 

8.3.1.- . 8.3.2.- 
Section ??: Constant Coefficients Systems 


77.77. . 22.77 


Chapter 7: 
Section 7.1: 


i Lee 
Section 7.2: 

7.2.1.- 
Section 7.3: 


7.3.1.- 


F. ANSWERS TO EXERCISES 


Boundary Value Problems 


Eigenvalue-Eigenfunction Problems 


7.1.2.- 
Overview of Fourier Series 

.2.2.= 
Applications: The Heat Equation 

7.3.2.- 
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